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FIELD OF THE INVENTION 



The invention relates generally to non-naturally occurring novel proteins which 
have insecticidal properties, and more specifically to the design, preparation, and use of 
proteins that have been deallergenized while maintaining then- insecticidal properties. 
Dealiergenized patatin proteins include variants that have had allergenic sequences 
modified, and permuteins that have had their amino acid sequences rearranged at one or 
more breakpoints. 

BACKGROUND OF THE INVENTION 
Insecticidal proteins 

The use of natural products, including proteins, is a well known method of 
controlling many insect, fungal, viral, bacterial, and nematode pathogens. For example, 
endotoxins of Bacillus thuringiensis {B.t.) are used to control both lepidopteran and 
coleopteran insect pests. Genes producing these endotoxins have been introduced into 
and expressed by various plants, including cotton, tobacco, and tomato. There are, 
however, several economically important insect pests such as boll weevil (BWV), 
Anthonomus grandis, and com rootworm (CRW), Diabrotica spp. that are not as 
susceptible to B.t. endotoxins as are insects such as lepidopterans. In addition, having 
other, different gene products for control of insects which are susceptible to B.t. 
endotoxins is important, if not vital, for resistance management. 

It has been recently discovered that the major storage protein of potato tubers, 
patatins (Gaillaird, T., Biochem. J. 121: 379-390, 1971; Racusen, D., Can. J. Bot., 62: 
1640-1644, 1984; Andrews, D.L., et al, Biochem. J., 252: 199-206, 1988), will control 
various insects, including western rootworm (WCRW, Diabrotica virigifera), southem 
com rootworm (SCRW, Diabrotica undecimpunctata), and boll weevil (BWV, 
Anthonomus grandis) (U.S. Patent No. 5,743,477). Patatins are lethal to some larvae and 
will stunt the growth of survivors so that maturation is prevented or severely delayed, 
resulting in no reproduction. These proteins, have nonspecific lipid acyl hydrolase 
activity and studies have shown that the en2yme activity is essential for its insecticidal 
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activity (StrickJand, J.A., et al. Plant Physiol, 109: 667-674, 1995; U.S. Patent No. 
5,743,477). Patatins can be applied directly to the plants or introduced in other ways well 
known in the art, such as through the application of plant-colonizing microorganisms, 
which have been transformed to produce the enzymes, or by the plants themselves after 
similar transformation. 

In potato, the patatins are found predominantly in tubers, but also at much lower 
levels in other plant organs (Hofgen, R. and Willmitzer, L., Plant Science, 66: 221-230, 
1990). Genes that encode patatins have been previously isolated by Mignery, G.A., et al. 
{Nucleic Acids Research, 12: 7987-8000, 1984; Mignery, G.A., et al. Gene, 62: 27-44, 
1988; Stiekema, et al. Plant Mol. Biol., 11: 255-269, 1988) and others. Patatins are 
foimd in other plants, particularly solanaceous species (Ganal, et al, Mol Gen. Genetics, 
225: 501-509, 1991; Vancanneyt, et al. Plant Cell, 1: 533-540, 1989) and recently Zea 
mays (WO 96/37615). Rosahl, et al. {EMBO J., 6: 1155-1159, 1987) transferred it to 
tobacco plants, and observed expression of patatin, demonstrating that the patatin genes 
can be heterologously expressed by plants. 

Patatin is an attractive for use in planta as an insect control agent, but 
unfortunately a small segment of the population displays allergic reactions to patatin 
proteins, and in particular to potato patatin, as described below. 

Food allergens 

There are a variety of proteins that cause allergic reactions. Proteins that have 
been identified as causing an allergic reaction in hypersensitive patients occur in many 
plant and animal derived foods, pollens, fungal spores, insect venoms, insect feces, and 
animal dander and urine (King, H.C., Ear Nose Throat J., 73(4): 237-241, 1994 ; 
Astwood, J.D., et al., Clin. Exp. Allergy, 25: 66-72, 1995; Astwood, J.D. and Fuchs R.L., 
Monographs in allergy Vol 32: Highlights in food allergy, pp. 105-120, 1996; Metcalfe, 
D.D., et al, Critical Reviews in Food Science and Nutrition, 36S: 165-186, 1996 ). The 
offending proteins of many major sources of allergens have been characterized by clinical 
and molecular methods. The functions of allergenic proteins in vivo are diverse, ranging 
firom enzymes to regulators of the cell cytoskeleton. 
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1 To understand the molecular basis of allergic disease, the important IgE binding 

2 epitopes of many allergen proteins have been mapped (Elsayed, S. and Apold, J., Allergy 

3 38(7): 449-459, 1983; Elsayed, S. et al, ScandJ. Clin. Lab. Invest. Suppl. 204: 17-31 

4 1991; Zhang, L., et al., Mol. Immunol. 29(11): 1383-1389, 1992). The optimal peptide 

5 length for IgE binding has been reported to be between 8 and 12 amino acids. 

6 Conservation of epitope sequences is observed in homologous allergens of disparate 

7 species (Astwood, J.D., et al, Clin. Exp. Allergy, 25: 66-72, 1995). Indeed, conservative 

8 substitutions introduced by site-directed mutagenesis reduce IgE binding of known 

9 epitopes when presented as peptides. 

10 Food allergy occurs in 2-6 % of the population. Eight foods or food groups (milk, 

11 eggs, fish, Crustacea, wheat, peanuts, soybeans, and tree nuts) account for 90% of 

12 allergies to foods. Nevertheless, over 160 different foods have been reported to cause 

13 adverse reactions, including potato (Hefle, S., et al, Crit. Rev. in Food Sci. Nutr., 36S: 

14 69-90, 1996). 

15 Mode of action of allergens 

16 Regardless of the identity of the allergen, it is theorized that the underlying 

17 mechanism of allergen response is the same. Immediate hypersensitivity (or anaphylactic 

18 response) is a form of allergic reaction which develops very quickly, i.e., within seconds 

19 or minutes of exposure of the patient to the causative allergen, and is mediated by B 

20 lymphocyte IgE antibody produciton. Allergic patients exhibit elevated levels of IgE, 

21 mediating hypersensitivity by priming mast cells which are abundant in the skin, 

22 lymphoid organs, in the membranes of the eye, nose and mouth, and in the respiratory 

23 tree and intestines. The IgE in allergy-suffering patients becomes bound to the IgE 

24 receptors of mast cells. When this bound IgE is subsequently contacted by the 

25 appropriate allergen, the mast cell is caused to degranulate and release various substances 

26 such as histamine into the surrounding tissue (Church et al. In: Kay, A.B. ed.. Allergy 

27 and Allergic Diseases, Oxford, Blackwell Science, pp. 149-1 97, 1997). 

28 It is the release of these substances which is responsible for the clinical symptoms 

29 typical of immediate hypersensitivity, namely contraction of smooth muscle in the 

30 airways or in the intestine, the dilation of small blood vessels, and the increase in their 
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permeability to water and plasma proteins, the secretion of thick sticky mucus, and (in the 
skin) the stimulation of nerve endings that result in itching or pain. Immediate 
hypersensitivity is, at best, a nuisance to the suffer; at worst it can present very serious 
problems and can in rare cases even result in death. 

Allergic reactions to potato 

Food allergy to potato is considered rare in the general population (Castells, M.C., 
et al, Allergy Clin. Immunol, 8: 1110-1114, 1986; Hannuksela, M., et al. Contact 
Dermatitis, 3: 79-84, 1977; Colbert, T.M., et al. Journal of Allergy, 44: 96-107, 1969). 
Approximately 200 individuals have participated in published clinical accounts of potato 
allergy (Hefle, S. et al. Critical Reviews in Food Science and Nutrition, 36S: 69-90, 
1996). A number of IgE binding proteins have been identified in potato tuber extracts 
(see Table 1), however the amino acid sequence and function of these proteins has not 
been determined (Wahl, R., etal, Intl Arch. Allergy Appl Immunol, 92: 168-174, 1990). 



Table 1 : Studies of potato tuber IgE-binding proteins (allergens) 



Study 


Protein Characteristics 


(Castells, M.C. et al J. Allergy Clin. Immunol 78, 1110- 
1114, 1986) 


Unknown 14 to 40 kDa 


(Wahl, R. et al Int. Arch. Allergy Appl Immunol 92: 
168-174, 1990) 


Unknown 42/43 kDa 




Unknown 65 kDa 




Unknown 26 kDa 




Unknown 20 kDa 




Unknown 14 kDa 




Unknown < 14 kDa (~ 5 kDa) 


(Ebner, C. et al in: Wuthrich, B. & Ortolam, C. (eds.), 
Highlights in food allergy. Monographs in Allergy, 
Volume 32 Basil,Karger, pp. 73-77, 1996) 


Unknown 42/43 kDa 




Unknown 23 kDa 




Unknown - 16 kDa 




Unknown < 14 kDa (~ 5 kDa) 



Improved safetv from the use of hvpoallereenic proteins 



Patatin has been identified as an allergenic protein (Seppala, U. et al., J. Allergy 
Clin. Immunol. 103:165-171, 1999). Accordingly, potato allergic subjects may not be 
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able to safely consume products containing unmodified patatin protein, such as crops to 
which foliar applications of patatins have been applied, or crops which have been 
engineered to express patatin. In addition, proliferation of food allergens in the food 
supply is considered hazardous (Metcalfe, D.D., et al.. Critical Reviews and Food 
Science and Nutrition, 36S: 165-186, 1996). There are additional concerns regarding the 
use of potentially allergenic food proteins where workers might be exposed to airborne 
particulates, initiating a new allergic response (Moneret-Vautrin, D.A., et al. Rev. Med. 
Interne.. 17(7): 551-557, 1996). 

Permuteins 

Novel proteins generated by the method of sequence transposition resembles that 
of naturally occurring pairs of proteins that are related by linear reorganization of their 
amino acid sequences (Cunningham, et al. Proc. Natl. Sci., U.S.A., 76: 3218-3222, 1979; 
leather, et al., J. Bacterial., 172: 3837-3841, 1990; Schimming, et al., Eur. J. Biochem., 
204: 13-19, 1992; Yamiuchi, et al., FEBS Lett., 260: 127-130, 1991; MacGregor, et al., 
FEBS. Lett., 378: 263-266, 1996). The first in vitro application of sequence 
rearrangement to proteins was described by Goldenberg and Creighton (Goldenberg and 
Creighton, J. Mol. Biol, 165: 407-413, 1983). A new N-terminus is selected at an 
internal site (breakpoint) of the original sequence, the new sequence having the same 
order of amino acids as the original from the breakpoint until it reaches an amino acid 
that is at or near the original C-terminus. At this point the new sequence is joined, either 
directly or through an additional portion or sequence (linker), to an amino acid that is at 
or near the original N-terminus, and the new sequence continues with the same sequence 
as the original until it reaches a point that is at or near the amino acid that was N-terminal 
to the breakpoint site of the original sequence, this residue forming the new C-terminus 
of the chain. This approach has been applied to proteins which range in size from 58 to 
462 amino acids and represent a broad range of structural classes (Goldenberg and 
Creighton, J. Mol. Biol, 165: 407-413, 1983; Li and CofFmo, Mol. Cell. Biol, 13: 2377- 
2383, 1993; Zhang, et al. Nature Struct. Biol, 1: 434-438, 1995; Buchwalder, et al. 
Biochemistry, 31: 1621-1630, 1994; Protasova, et al, Prot. Eng., 7: 1373-1377, 1995; 
Mullins, et al,J. Am. Chem. Soc, 1 16: 5529-5533, 1994; Garrett, et al. Protein Science, 
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1 5: 204-211, 1996; Hahn, et al, Proc. Natl. Acad. Sci. U.S.A., 91: 10417-10421, 1994; 

2 Yang and Schachman, Proc. Natl. Acad. Sci. U.S.A., 90: 1 1980-11984, 1993; Luger, et 

3 al.. Science, 243: 206-210, 1989; Luger, et al., Prot. Eng., 3: 249-258, 1990; Lin, et al, 

4 Protein Science, 4: 159-166, 1995; Vignais, et al, Protein Science, 4: 994-1000, 1995; 

5 Ritco-Vonsovici, et al. Biochemistry, 34: 16543-16551, 1995; Horlick, et al. Protein 

6 Eng., 5: 427-431, 1992; Kreitman, et al. Cytokine. 7: 311-318, 1995; Viguera, et al, 

7 Mol Biol, 247: 670-681, 1995; Koebnik and Kramer, J. Mol Biol, 250: 617-626, 1995; 



8 Kreitman, et al, Proc. Natl Acad Sci., 91: 6889-6893, 1994). 

9 There exists a need for the development of plant expressible insecticidal proteins 

10 which possess minimal or no allergenic properties. 

1 1 SUMMARY OF THE INVENTION 

12 Novel protein sequences, and nucleic acid sequences encoding them are disclosed. 

13 The proteins maintain desirable enzymatic and insecticidal properties while displaying 

14 reduced or eliminated allergenicity. 

15 Allergenic epitopes are identified by scanning overlapping peptide sequences with 

16 an immunoreactivity assay. Alanine scaiming and 'rational substitution' is performed on 

17 identified peptide sequences to determine specific amino acids which contribute to 

18 antibody binding, and presumably, to the allergenic properties of the whole protein. 

19 Individual mutations are introduced into the whole protein sequence by methods such as 

20 site directed mutagenesis of the encoding nucleic acid sequence to delete or modify the 

21 allergenic sequences. 

22 Glycosylation target residues are identified within amino acid sequences of 

23 proteins which have demonstrated allergy eliciting properties. Glycosylation target 

24 amino acid residues are rationally substutited with other amino acid residues to eliminate 

25 glycosylation and to provide a variant deglycosylated protein. The variant protein may 

26 then exhibit reduced allergen eliciting properties and may also exhibit reduced binding to 

27 IgE within serum of patients observed to be allergic to said glycosylated protein. 

28 Permuteins of the deallergenized protein sequences can be constructed to further 

29 reduce or eliminate allergic reactions. The encoding nucleic acid sequence is modified to 
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produce a non-naturally occurring protein having a linear amino acid sequence different 
from the naturally occurring protein sequence, while maintaining enzymatic and 
insecticidal properties. The permutein is preferably produced in plant cells, and more 
preferably produced at a concentration which is toxic to insects ingesting the plant cells. 

Methods for reducing, eliminating, or decreasing allergen eliciting properties of a 
protein are specifically contemplated herein. Such methods comprise steps including 
identifying one or more patients exhibiting an allergic sensitivity to an allergen eliciting 
protein and obtaining a sample of serum from the patient; exposing the patient serum to a 
first set of synthetic overlapping peptides which represent the allergen eliciting protein in 
order to identify such peptides which exhibit epitopes which bind to IgE present within 
the allergic patients' serum and wherein the IgE present in the serum has a specific 
affinity for the said allergen eliciting protein; producing a second set of peptides which 
are variant peptides based on the first set of peptides which were identified to bind 
specifically to IgE present in patient serum, wherein the second set variant peptides 
exhibit alanine scanning or rational scanning amino acid substitutions which exhibit 
reduced, decreased, or eliminated IgE binding when compared to the first set non-variant 
peptides, and wherein such substitutions which reduce, eliminate or decrease IgE binding 
are identified as result effective substitutions; and modifying the amino acid sequence of 
the allergen eliciting protein to contain one or more of said result effective substitutions, 
wherein the modified protein is a variant of the allergen eliciting protein which lacks 
allergen eliciting protein or exhibits reduced allergen eliciting properties, and wherein the 
variant of the allergen eliciting protein comprising one or more result effective 
substitutions exhibits reduced, decreased, or totally eliminated binding of IgE present 
within said patients' serum. 

The novel proteins can be used in controlling insects, as nutritional supplements, 
in immunotherapy protocols, and in other potential applications. Transgenic plant cells 
and plants containing the encoding nucleic acid sequence can be particularly beneficial in 
the control of insects, and as a nutritional/immunotherapy material. 
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DESCRIPTION OF THE FIGURES 



The following figures form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention can be better 
understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

Figure 1 illustrates the alignment of potato patatin PatA (acyl lipid hydrolase) 
with patatin (acyl lipid hydrolase) homologs and related amino acid sequences, the 
homologs and related sequences being from both dicot and monocot plant species. 

Figure 2 illustrates IgE binding to overlapping peptide sequences. 

Figure 3 illustrates construction of nucleic acid sequences encoding patatin 
permutein proteins, and in this figure for illustrative purposes a breakpoint at position 247 
is shown. 

DESCRIPTION OF THE SEQUENCE LISTINGS 

The following description of the sequence listing forms part of the present 
specification and is included to further demonstrate certain aspects of the present 
invention. The invention can be better understood by reference to one or more of these 
sequences in combination with the detailed description of specific embodiments 
presented herein. 



SEQ IDN0:1 


DNA sequence encoding a patatin (acyl lipid hydrolase) 




protein 


SEQ ID N0:2 


potato patatin protein sequence 


SEQ ID N0:3 


thermal amplification primer 


SEQ ID N0:4 


thermal amplification primer 


SEQ ID NO: 5 


thermal amplification product 


SEQ ID N0:6 


Pre-cleavage patatin protein produced in Pichia pastoris 


SEQ ID N0:7 


Post-cleavage patatin protein produced in Pichia pastoris 


SEQ ID N0:8 


Y106F mutagenic primer 


SEQ ID N0:9 


Y129F mutagenic primer 


SEQ ID NO: 10 


Y185F mutagenic primer 
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SEQ ID NO: 11 


Y193F mutagenic primer 


SEQ ID NO: 12 


Y185F and Y193F mutagenic primer 


SEQ ID NO: 13 


Y270F mutagenic primer 


cpO TD NO- 14 




SEQ ID NO: 15 


Y362F mutagenic primer 


SEQ ID NO: 16- 104 


Peptide scan sequences of a patatin protein 


SEQ ID NO: 105-241 


Alanine and rational scan sequences of selected patatin 




peptides 


SEQ ID NO:242 


thermal amplification primer 27 


SEQ IDNO:243 


thermal amplification primer 48 


SEQ ID NO:244 


thermal amplification primer 47 


SEQIDNO:245 


thermal amplification primer 36 


SEQ ID NO:246 


pMON37402 sequence encoding permutein protein 


SEQ IDNO:247 


Permutein protein encoded from pMON37402 sequence 


SEQ IDNO:248 


thermal amplification primer 58 


SEQIDNO:249 


thermal amplification primer 59 


SEQ IDNO:250 


pMON37405 sequence encoding permutein protein 


SEQ ID NO:251 


Permutein protein encoded by pMON37405 sequence 


SEQ ID NO:252 


thermal amplification primer 60 


SEQ IDNO:253 


thermal amplification primer 61 


SEQIDNO:254 


pMON37406 sequence encoding permutein protein 


SEQIDNO:255 


Permutein protein encoded by pMON37406 sequence 


SEQ ID NO:256 


thermal amplification primer 62 


SEQ ID NO:257 


thermal amplification primer 63 


SEQ IDNO:258 


pMON37407 sequence encoding permutein protein 


SEQ IDNO:259 


Permutein protein encoded by pMON37407 sequence 


SEQIDNO:260 


thermal amplification primer 60 


SEQ ID NO:261 


thermal amplification primer65 


SEQ ID NO:262 


pMON37408 sequence encoding permutein protein 


CPO m "MHO/;'? 


Permutein protein encoded by pMON37408 sequence 


SEQ IDN0.264 


pMON40701 sequence encoding permutein protein 
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SEQ ID NO:265 


Permutein protein encoded by pMON40701 sequence 


SEQ ID NO:266 


thermal amplification primer Synl 


SEQ ID NO:267 


thermal amplification primer Syn2 


SEQ ID NO:268 


thermal amplification primer Syn3 


SEQ ID NO:269 


thermal amplification primer Syn4 


SEQ IDNO:270 


pMON40703 sequence encoding permutein protein 


SEQIDNO:271 


Permutein protein encoded by pMON40703 sequence 


SEQ ID NO:272 


thermal amplification primer SynlO 


SEQ ID NO:273 


thermal amplification primer Synl 1 


SEQ IDNO:274 


pMON40705 sequence encoding permutein protein 


SEQ IDNO:275 


Permutein protein encoded by pMON40705 sequence 


SEQ ID NO:276-277 


Permutein linker sequences 


SEQ IDNO:278 


Patatin isozyme PatA+ (including signal peptide) 


SEQIDNO:279 


Patatin isozyme PatB+ (including signal peptide) 


SEQ ID NO:280 


Patatin isozyme PatFm (mature protein lacking signal peptide) 


SEQ ID NO:281 


Patatin isozyme Patim (mature protein lacking signal peptide) 


SEQ ID NO:282 


Patatin isozyme PatL+ (including signal peptide) 


SEQ ID NO:283 


Rational substitution peptide 


SEQ ID NO:284 


Com homolog peptide 


SEQIDNO:285 


patatin homolog Pat 17 DNA coding sequence and amino acid 




translation 


SEQ ID NO:286 


patatin homolog Pat 17 amino acid sequence 


SEQ ID NO:287 


dicot patatin homolog amino acid sequence pentinl_phb 


SEQ IDNO:288 


dicot patatin homolog amino acid sequence 5c9_phb 


SEQ ID NO:289 


maize patatin homolog amino acid sequence coml_pep 


SEQ ID NO:290 


maize patatin homolog amino acid sequence com2_pep 


SEQ IDNO:291 


maize patatin homolog amino acid sequence com3__pep 


SEQ ID NO:292 


maize patatin homolog amino acid sequence com4_pep 


SEQ ID NO:293 


maize patatin homolog amino acid sequence com5 j5ep 
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DEFINITIONS 



The following definitions are provided in order to aid those skilled in the art in 
understanding the detailed description of the present invention. Some words and phrases 
may also be defined in other sections of the specification. No limitation should be placed 
on the definitions presented for the terms below, where other meanings are evidenced 
elseqhere in the specification in addition to those specified below. 

"Allergen" refers to a biological or chemical substance that induces an allergic 
reaction or response. An allergic response can be an immunoglobulin E-mediated 
response. 

Amino acid codes: A (Ala) = alanine; C (Cys) = cysteine; D (Asp) = aspartic acid; 
E (Glu) = glutamic acid; F (Phe) = phenylalanine; G (Gly) = glycine; H (His) = histidine; 
I (He) = isoleucine; K (Lys) = lysine; L (Leu) = leucine; M (Met)= methionine; N (Asn) = 
asparagine; P (Pro) = proline; Q (Gin) = glutamine; R (Arg) = arginine; S (Ser) = serine; 
T (Thr) = threonine; V = (Val) valine; W (Trp) = tryptophan; Y (Tyr) = tyrosine. 

"Amplification: refers to increasing the number of copies of a desired molecule. 

"Coding sequence", "open reading frame", and "structural sequence" refer to the 
region of continuous sequential nucleic acid base pair triplets encoding a protein, 
polypeptide, or peptide sequence. 

"Codon" refers to a sequence of three nucleotides that specify a particular amino 

acid. 

"Complementarity" refers to the specific binding of adenine to thymine (or uracil 
in RNA) and cytosine to guanine on opposite strands of DNA or RNA. 

"Deallergenize" (render hypoallergenic) refers to the method of engineering or 
modifying a protein or the encoding DNA such that the protein has a reduced or 
eliminated ability to induce an allergic response with respect to the ability of the 
unmodified protein. A deallergenized protein can be referred to as being hypoallergenic. 
The degree of deallergenization of a protein can be measured in vitro by the reduced 
binding of IgE antibodies. 

"DNA segment heterologous to the promoter region" means that the coding DNA 
segment does not exist in nature in the same gene with the promoter to which it is now 
attached. 
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"DNA segment" refers to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. 

"Electroporation" refers to a method of introducing foreign DNA into cells that 
uses a brief, high voltage DC (direct current) charge to permeabilize the host cells, 
causing them to take up extra-chromosomal DNA. 

"Encoding DNA" refers to chromosomal DNA, plasmid DNA, cDNA, or 
synthetic DNA which encodes any of the enzymes discussed herein. 

"Endogenous" refers to materials originating from within an organism or cell. 

"Endonuclease" refers to an enzyme that hydrolyzes double stranded DNA at 
internal locations. 

"Epitope" refers to a region on an allergen that interacts with the cells of the 
irrmiune system. Epitopes are often fiirther defined by the type of antibody or cell with 
which they interact, e.g. if the region reacts with B-cells or antibodies (IgE), it is called a 
B-cell epitope. 

"Exogenous" refers to materials originating from outside of an organism or cell. 
This typically applies to nucleic acid molecules used in producing transformed or 
transgenic host cells and plants. 

"Expressibly coupled" and "expressibly linked" refer to a promoter or promoter 
region and a coding or structural sequence in such an orientation and distance that 
transcription of the coding or structural sequence can be directed by the promoter or 
promoter region. 

"Expression" refers to the transcription of a gene to produce the corresponding 
mRNA and translation of this mRNA to produce the corresponding gene product, i.e., a 
peptide, polypeptide, or protein. 

"Heterologous DNA" refers to DNA from a source different than that of the 
recipient cell. 

"Homologous DNA" refers to DNA from the same source as that of the recipient 

cell. 

"Identity" refers to the degree of similarity between two nucleic acid or protein 
sequences. An alignment of the two sequences is performed by a suitable computer 
program. A widely used and accepted computer program for performing sequence 
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alignments is CLUSTALW vl.6 (Thompson, et al. Nucl. Acids Res., 22: 4673-4680, 
1994). The number of matching bases or amino acids is divided by the total number of 
bases or amino acids, and multiplied by 100 to obtain a percent identity. For example, if 
two 580 base pair sequences had 145 matched bases, they would be 25 percent identical. 
If the two compared sequences are of different lengths, the number of matches is divided 
by the shorter of the two lengths. For example, if there were 100 matched amino acids 
between 200 and a 400 amino acid proteins, they are 50 percent identical with respect to 
the shorter sequence. If the shorter sequence is less than 150 bases or 50 amino acids in 
length, the number of matches are divided by 150 (for nucleic acid bases) or 50 (for 
amino acids), and multiplied by 100 to obtain a percent identity. 

"IgE" (Immunoglobulin E) refers to a specific class of immunoglobulin secreted 
by B cells. IgE binds to specific receptors on Mast cells. Interaction of an allergen with 
mast cell-bound IgE may trigger allergic symptoms. 

"Immunotherapy" refers to any type of treatment that targets the immune system. 
Allergy immunotherapy is a treatment in which a progressively increasing dose of an 
allergen is given in order to induce an immune response characterized by tolerance to the 
antigen/allergen, also known as desensitization. 

"In vitro" refers to "in the laboratory" and/or "outside of a living organism". 

"In vivo" refers to "in a living organism". 

"Insecticidal polypeptide" refers to a polypeptide having insecticidal properties 
that adversely affects the growth and development of insect pests. 

"Monocot" refers to plants having a single cotyledon (the first leaf of the embryo 
of seed plants); examples include cereals such as maize, rice, wheat, oats, and barley. 

"Multiple cloning site" refers to an artificially constructed collection of restriction 
enzyme sites in a vector that facilitates insertion of foreign DNA into the vector. 

"Mutation" refers to any change or alteration in a nucleic acid sequence. Several 
types exist, including point, fi-ame shift, splicing, and insertion/deletions. 

"Native" refers to "naturally occurring in the same organism". For example, a 
native promoter is the promoter naturally found operatively linked to a given coding 
sequence in an organism. A native protein is one naturally found in nature and untouched 
or not otherwise manipulated by the hand of man. 
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"Nucleic acid segment" is a nucleic acid molecule that has been isolated free of 
total genomic DNA of a particular species, or that has been synthesized. Included with 
the term "nucleic acid segment" are DNA segments, recombinant vectors, plasmids, 
cosmids, phagemids, phage, viruses, etcetera. 

"Nucleic acid" refers to deoxyribonucleic acid (DNA) and ribonucleic acid 

(RNA). 

Nucleic acid codes: A = adenosine; C = cytosine; G = guanosine; T = thymidine; 
N = equimolar A, C, G, and T; 1 = deoxyinosine; K = equimolar G and T; R = equimolar 
A and G; S = equimolar C and G; W = equimolar A and T; Y = equimolar C and T. 

"Open reading frame (ORF)" refers to a region of DNA or RNA encoding a 
peptide, polypeptide, or protein or capable of being translated to protein, or a regioiu of 
DNA capable of being transcribed into an RNA product. 

"Plasmid" refers to a circular, extrachromosomal, self-replicating piece of DNA. 

"Point mutation" refers to an alteration of a single nucleotide in a nucleic acid 
sequence. 

"Polymerase chain reaction (PGR)" refers to an enzymatic technique to create 
multiple copies of one sequence of nucleic acid. Copies of DNA sequence are prepared 
by shuttling a DNA polymerase between two oligonucleotides. The basis of this 
amplification method is multiple cycles of temperature changes to denature, then re- 
anneal amplimers, followed by extension to synthesize new DNA strands in the region 
located between the flanking amplimers. Also known as thermal amplification. 

"Probe" refers to a polynucleotide sequence which is complementary to a target 
polynucleotide sequence in the analyte. An antibody can also be used as a probe to detect 
the presence of an antigen. In that sense, the antigen binding domain of the antibody has 
some detectable affinity for the antigen and binds thereto. The binding of the antibody to 
the antigen can be measured by means known in the art, such as by chemiluminescence, 
phosphorescence, flourescence, colorimetric chemical deposition at the site of binding, or 
otherwise. 

"Promoter" or "promoter region" refers to a DNA sequence, usually found 
upstream (5') to a coding sequence, that controls expression of the coding sequence by 
controlling production of messenger RNA (mRNA) by providing the recognition site for 



H.4I94IS(8ZMFOI!.DOC) 



-15- of -2 13- 



RNA polymerase and/or other factors necessary for start of transcription at the correct 
site. As contemplated herein, a promoter or promoter region includes variations of 
promoters derived by means of ligation to various regulatory sequences, random or 
controlled mutagenesis, and addition or duplication of enhancer sequences. The 
promoter region disclosed herein, and biologically functional equivalents thereof, are 
responsible for driving the transcription of coding sequences under their control when 
introduced into a host as part of a suitable recombinant vector, as demonstrated by its 
ability to produce mRNA. 

"Recombinant DNA construct" or "recombinant vector" refers to any agent such 
as a plasmid, cosmid, virus, autonomously replicating sequence, phage, or linear or 
circular single-stranded or double-stranded DNA or RNA nucleotide sequence, derived 
from any source, capable of genomic integration or autonomous replication, comprising a 
DNA molecule in which one or more DNA sequences have been linked in a functionally 
operative manner. Such recombinant DNA constructs or vectors are capable of 
introducing a 5' regulatory sequence or promoter region and a DNA sequence for a 
selected gene product into a cell in such a manner that the DNA sequence is transcribed 
into a ftmctional mRNA which is translated and therefore expressed. Recombinant DNA 
constructs or recombinant vectors can be constructed to be capable of expressing 
antisense RNAs, in order to inhibit translation of a specific RNA of interest. 

"Recombinant proteins", also referred to as "heterologous proteins", are proteins 
which are normally not produced by the host cell. 

"Regeneration" refers to the process of growing a plant from a plant cell (e.g., 
plant protoplast or explant). 

"Regeneration" refers to the process of growing a plant from a plant cell {e.g., 
plant protoplast or explant). 

"Regulatory sequence" refers to a nucleotide sequence located upstream (5'), 
within, and/or downstream (3') to a DNA sequence encoding a selected gene product 
whose transcription and expression is controlled by the regulatory sequence in 
conjunction with the protein synthetic apparatus of the cell. 
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"Restriction enzyme" refers to an enzyme that recognizes a specific palindromic 
sequence of nucleotides in double stranded DNA and cleaves both strands; also called a 
restriction endonuclease. Cleavage typically occurs within the restriction site. 

"Result-effective substitution" (RES) refers to an amino acid substitution within 
an IgE-binding region (epitope) of a target protein which reduces or eliminates the IgE 
binding by that epitope. Examples herein are directed to patatin protein and homologues, 
however, as will be readily recognized by those skilled in the art, the method is more 
broadly applicable to proteins other than patatins, and in particular is applicable to any 
protein exhibiting allergen eliciting properties. 

"Selectable marker" refers to a nucleic acid sequence whose expression confers a 
phenotype facilitating identification of cells containing the nucleic acid sequence. 
Selectable markers include those which confer resistance to toxic chemicals (e.g. 
ampicillin resistance, kanamycin resistance), complement a nutritional deficiency (e.g. 
uracil, histidine, leucine), or impart a visually distinguishing characteristic (e.g. color 
changes or fluorescence). 

"Transcription" refers to the process of producing an RNA copy from a DNA 
template. 

"Transformation" refers to a process of introducing an exogenous nucleic acid 
sequence (e.g., a vector, recombinant nucleic acid molecule) into a cell or protoplast in 
which that exogenous nucleic acid is incorporated into a chromosome or is capable of 
autonomous replication. 

"Transformed cell" is a cell whose DNA has been altered by the introduction of 
an exogenous nucleic acid molecule into that cell. 

"Transgenic cell" refers to any cell derived from or regenerated from a 
transformed cell or derived from a transgenic cell. Exemplary transgenic cells include 
plant calli derived from a transformed plant cell and particular cells such as leaf, root, 
stem, e.g., somatic cells, or reproductive (germ) cells obtained from a transgenic plant. 

"Transgenic plant" refers to a plant or progeny thereof derived from a transformed 
plant cell or protoplast, wherein the plant DNA contains an infroduced exogenous nucleic 
acid sequence not originally present in a native, non-transgenic plant of the same species. 
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Alternatively, the plant DNA can contain the introduced nucleic acid sequence in a higher 
copy number than in the native, non-transgenic plant of the same species. 

"Translation" refers to the production of protein from messenger RNA. 

"Vector" refers to a plasmid, cosmid, bacteriophage, or virus that carries foreign 
DNA into a host organism. 

"Western blot" refers to protein or proteins that have been separated by 
electrophoresis, transferred and immobilized onto a solid support, then probed with an 
antibody. 

DETAILED DESCRIPTION OF THE INVENTION 

Design of deallereenized patatin proteins 

Deallergenizing a protein can be accomplished by the identification of allergenic 
sites, followed by modification of the sites to reduce or eliminate the binding of 
antibodies to the sites. The IgE-binding regions of patatin were previously unreported. 
Mapping of the IgE epitopes was accomplished by synthesizing 10-mer peptides based on 
the patatin 17 protein sequence (SEQ ID NO: 2) which overlap by six amino acids. As 
potato proteins are denatured upon cooking potato products, it is expected that the 10-mer 
peptides sufficiently mimic the unfolded full length protein for antibody binding 
purposes. Peptides were identified based upon their ability to bind to IgE antibodies. 
Individual amino acids within the identified peptides were changed to reduce or eliminate 
binding to IgE present in sera from potato sensitive patients. These changes are termed 
result-effective amino acid substitutions (RES). The RES can be subsequently introduced 
into the full length protein by site directed mutagenesis of the encoding nucleic acid 
sequence or other means known in the art. Similar strategies have been employed 
elsewhere to determine the dominant IgE epitopes in a major peanut allergen (Stanley, 
J.S., etal.,Arch. Biochem. Biophys., 342(2): 244-253, 1997). 

Certain amino acid residues important for allergenicity of patatin are identified. 
Some of the designed patatin peptides wherein single amino acid residues were replaced 
with alanine or phenylalanine, showed significantly reduced or no binding to sera from 
potato sensitive patients. 
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1 A "deallergenized patatin" refers to a patatin protein differing in at least one of 

2 the amino acid residues as defined by the result effective substitutions resulting in the 

3 patatin protein having reduced reactivity towards sera from potato sensitive patients. The 

4 deallergenized patatin preferably maintains insecticidal properties, and preferably 

5 maintains its characteristic enzymatic profile. 

6 Summary of method to deallergenize a patatin protein 

7 ■ Mapping of IgE epitopes by immunoassay of synthetic overlapping peptides 

g using sera from potato sensitive patients; 

9 ■ Identification of result-effective substitutions by alanine scanning and/or 

10 rational scanning; 

11 ■ Modification of the amino acid sequence of patatin by site-directed 

12 mutagenesis of the encoding nucleic acid sequence; 

13 ■ Evaluation of enzyme activity (esterase) and/or insecticidal activity of the 

14 modified protein(s); and 

15 ■ Evaluation of the new protein(s) for allergenicity by IgE immunoassay. 

16 

17 Nucleic acid sequences encoding patatin have been cloned by several 

18 investigators (e.g. Mignery, et al. Nucleic Acids Research, 12: 7987-8000, 1984; 

19 Mignery, et al. Gene, 62: 27-44, 1988; WO 94/21805; Canadian Patent Application No. 

20 2090552). These nucleic acid sequences can then be manipulated using site directed 

21 mutagenesis to encode a hypoallergenic patatin. These nucleic acid sequences can than 

22 be used to transform bacterial, yeast or plant cells, resulting in the production of 

23 hypoallergenic patatin protein. 

24 Deallergenized patatin proteins 

25 For simplicity, individual amino acids are referred to by their single letter codes. 

26 Correlation between the single letter codes, three letter codes, and fiiU amino acid names 

27 is presented in the definitions section above. 

28 One embodiment of the invention is an isolated deallergenized patatin protein. 

29 The protein is modified relative to the wild-type protein sequence such that they exhibit 
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reduced binding to anti-patatin antibodies such as those obtained from humans or animals 
allergic to potatoes. The reduced binding is measured relative to the binding of the 
unmodified patatin protein to the anti-patatin antibodies. 

The deallergenized patatin protein can comprise SEQ ID N0:2 modified in one or 
more of the following regions, or SEQ ID N0:7 modified in one or more of the following 
regions. The single or multiple amino acid modifications reduce the binding of the 
modified protein relative to the binding of the corresponding unmodified protein. The 
regions for modification include amino acid positions 104-113 of SEQ ID N0:2 (85-94 
of SEQ ID N0:7), 128-137 of SEQ ID N0:2 (109-118 of SEQ ID N0:7), 184-197 of 
SEQ ID N0:2 (165-178 of SEQ ID N0:7), 264-277 of SEQ ID N0:2 (245-258 of SEQ 
ID N0:7), 316-325 of SEQ ID N0:2 (297-306 of SEQ ID N0:7), and 360-377 of SEQ ID 
N0:2 (341-358 of SEQ ID NO:7). The possible amino acid modifications include 
replacing an amino acid with A, E, F, P, or S. The modifications replace one or more 
amino acids in the identified regions, without increasing or decreasing the total number of 
amino acids in the protein. 

Preferably, the deallergenized patatin protein comprises SEQ ID N0:2 modified 
by one or more changes, or SEQ ID N0:7 modified by one or more changes. SEQ ID 
N0:7 differs from wild type SEQ ID N0:2 in that the first 22 amino acids of SEQ ID 
N0:2 are replaced with EAE (Glu-Ala-Glu). For example, the changes to SEQ ID N0:2 
or SEQ ID N0:7 can be: the Y corresponding to position 106 of SEQ ID N0:2 or 
position 87 of SEQ ID NO: 7 is replaced with F or A; the I corresponding to position 1 13 
of SEQ ID N0:2 or position 94 of SEQ ID N0:7 is replaced with A; the Y corresponding 
to position 129 of SEQ ID N0:2 or position 1 10 of SEQ ID N0:7 is replaced with F or 
A; the K corresponding to position 137 of SEQ ID N0:2 or position 1 1 8 of SEQ ID N0:7 
is replaced witii A; the S corresponding to position 184 of SEQ ID N0:2 or position 165 
of SEQ ID N0:7 is replaced with A; the Y corresponding to position 185 of SEQ ID 
N0:2 or position 166 of SEQ ID N0:7 is replaced with F or A; the A corresponding to 
position 188 of SEQ ID N0:2 or position 169 of SEQ ID N0:7 is replaced with S; the T 
corresponding to position 192 of SEQ ID N0:2 or position 173 of SEQ ID N0:7 is 
replaced with A or P; the Y corresponding to position 193 of SEQ ID N0:2 or position 
174 of SEQ ID N0:7 is replaced with F or A; the K corresponding to position 268 of 
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SEQ ID N0:2 or position 249 of SEQ ID N0:7 is replaced with A or E; the T 
corresponding to position 269 of SEQ ID N0:2 or position 250 of SEQ ID N0:7 is 
replaced with A; the Y corresponding to position 270 of SEQ ID N0:2 or position 251 of 
SEQ ID N0:7 is replaced with F or A; the K corresponding to position 273 of SEQ ID 
NO:2 or position 254 of SEQ ID N0:7 is replaced with A; the K corresponding to 
position 313 of SEQ ID N0:2 or position 294 of SEQ ID N0:7 is replaced with E; the N 
corresponding to position 314 of SEQ ID N0:2 or position 295 of SEQ ID N0:7 is 
replaced with A; the N corresponding to position 315 of SEQ ID N0:2 or position 296 of 
SEQ ID N0:7 is replaced with A; the Y corresponding to position 316 of SEQ ID N0:2 
or position 297 of SEQ ID NO: 7 is replaced with F or A; the Y corresponding to position 
362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F; the K 
corresponding to position 367 of SEQ ID N0:2 or position 348 of SEQ ID N0:7 is 
replaced with A; the R corresponding to position 368 of SEQ ID N0:2 or position 349 of 
SEQ ID N0:7 is replaced with A; the F corresponding to position 369 of SEQ ID NO:2 
or position 350 of SEQ ID N0:7 is replaced with A; the K corresponding to position 371 
of SEQ ID N0:2 or position 352 of SEQ ID N0:7 is replaced with A; the L 
corresponding to position 372 of SEQ ID N0:2 or position 353 of SEQ ID N0:7 is 
replaced with A; and the L corresponding to position 373 of SEQ ID N0:2 or position 
354 of SEQ ID N0:7 is replaced with A. 

More preferably, SEQ ID N0:2 is modified by the following changes or SEQ ID 
N0:7 is modified by the following changes: the Y corresponding to position 106 of SEQ 
ID NO:2 or position 87 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 129 of SEQ ID N0:2 or position 1 10 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 193 of SEQ ID N0:2 or position 174 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 
or position 251 of SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 
of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is replaced with F; and the Y 
corresponding to position 362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is 
replaced with F. 
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Most preferably, SEQ ID N0;2 is modified by the following changes or SEQ ID 
N0:7 is modified by the following changes: the Y corresponding to position 185 of SEQ 
ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 or position 297 of 
SEQ ID N0:7 is replaced with F; and the Y corresponding to position 362 of SEQ ID 
N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Nucleic acids 

An additional embodiment of the invention is an isolated nucleic acid molecule 
segment comprising a structural nucleic acid sequence which encodes a deallergenized 
patatin protein. 

The structural nucleic acid sequence can generally encode any deallergenized 
patatin protein. The structural nucleic acid sequence preferably encodes a deallergenized 
patatin protein comprising SEQ ID N0:2 modified in one or more of the following 
regions, or SEQ ID N0:7 modified in one or more of the following regions. The single 
or multiple amino acid modifications reduce the binding of the modified protein relative 
to the binding of the corresponding unmodified protein. The regions for modification 
include amino acid positions 104-1 13 of SEQ ID N0:2 (85-94 of SEQ ID N0:7), 128- 
137 of SEQ ID N0:2 (109-1 18 of SEQ ID N0:7), 184-197 of SEQ ID N0:2 (165-178 of 
SEQ ID N0:7), 264-277 of SEQ ID N0:2 (245-258 of SEQ ID N0:7), 316-325 of SEQ 
ID N0:2 (297-306 of SEQ ID N0:7), and 360-377 of SEQ ID N0:2 (341-358 of SEQ ID 
N0:7). The possible amino acid modifications include replacing an amino acid with A, 
E, F, P, or S. The modifications replace one or more amino acids in the identified 
regions, without increasing or decreasing the total number of amino acids in the protein. 

Alternatively, the structural nucleic acid sequence encodes SEQ ID N0:2 
modified by one or more of the following changes or encoding SEQ ID N0;7 modified 
by one or more of the following changes: the Y corresponding to position 106 of SEQ ID 
N0:2 or position 87 of SEQ ID N0:7 is replaced with F or A; the I corresponding to 
position 1 13 of SEQ ID N0:2 or position 94 of SEQ ID N0:7 is replaced with A; the Y 
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1 corresponding to position 129 of SEQ ID N0:2 or position 110 of SEQ ID N0:7 is 

2 replaced with F or A; the K corresponding to position 137 of SEQ ID N0:2 or position 

3 1 1 8 of SEQ ID N0:7 is replaced with A; the S corresponding to position 1 84 of SEQ ID 

4 NO: 2 or position 165 of SEQ ID NO: 7 is replaced with A; the Y corresponding to 

5 position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F or A; 

6 the A corresponding to position 1 88 of SEQ ID N0:2 or position 1 69 of SEQ ID N0:7 is 

7 replaced with S; the T corresponding to position 192 of SEQ ID N0:2 or position 173 of 

8 SEQ ID N0:7 is replaced with A or P; the Y corresponding to position 193 of SEQ ID 

9 N0:2 or position 174 of SEQ ID N0:7 is replaced with F or A; the K corresponding to 

10 position 268 of SEQ ID N0:2 or position 249 of SEQ ID N0:7 is replaced with A or E; 

11 the T corresponding to position 269 of SEQ ID N0:2 or position 250 of SEQ ID N0:7 is 

12 replaced with A; the Y corresponding to position 270 of SEQ ID N0:2 or position 25 1 of 

13 SEQ ID N0:7 is replaced with F or A; the K corresponding to position 273 of SEQ ID 

14 N0:2 or position 254 of SEQ ID N0:7 is replaced with A; the K corresponding to 

1 5 position 3 1 3 of SEQ ID N0:2 or position 294 of SEQ ID N0:7 is replaced with E; the N 

16 corresponding to position 314 of SEQ ID N0:2 or position 295 of SEQ ID N0:7 is 

17 replaced with A; the N corresponding to position 3 1 5 of SEQ ID N0:2 or position 296 of 

18 SEQ ID N0:7 is replaced with A; the Y corresponding to position 316 of SEQ ID N0:2 

19 or position 297 of SEQ ID N0:7 is replaced with F or A; the Y corresponding to position 

20 362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F; the K 

21 corresponding to position 367 of SEQ ID N0:2 or position 348 of SEQ ID N0:7 is 

22 replaced with A; the R corresponding to position 368 of SEQ ID N0:2 or position 349 of 

23 SEQ ID N0:7 is replaced with A; the F corresponding to position 369 of SEQ ID N0:2 

24 or position 350 of SEQ ID N0:7 is replaced with A; the K corresponding to position 371 

25 of SEQ ID N0:2 or position 352 of SEQ ID N0:7 is replaced with A; the L 

26 corresponding to position 372 of SEQ ID N0:2 or position 353 of SEQ ID N0:7 is 

27 replaced with A; and the L corresponding to position 373 of SEQ ID N0:2 or position 

28 354 of SEQ ID N0:7 is replaced with A. 

29 More preferably, the structural nucleic acid sequence encodes SEQ ID N0:2 

30 modified by the following changes or SEQ ID N0:7 modified by the following changes: 

31 the Y corresponding to position 106 of SEQ ID N0:2 or position 87 of SEQ ID N0:7 is 
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replaced with F; the Y corresponding to position 129 of SEQ ID N0:2 or position 1 10 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 185 of SEQ ID N0:2 
or position 166 of SEQ ID NO: 7 is replaced with F; the Y corresponding to position 193 
of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 or position 297 of 
SEQ ID N0:7 is replaced with F; and the Y corresponding to position 362 of SEQ ID 
N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Most preferably, the structural nucleic acid sequence encodes SEQ ID N0:2 
modified by the following changes or SEQ ID N0:7 modified by the following changes: 
the Y corresponding to position 1 85 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 193 of SEQ ID N0:2 or position 174 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 
or position 251 of SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 
of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is replaced with F; and the Y 
corresponding to position 362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is 
replaced with F. 

Recombinant vectors 

An additional embodiment is directed towards recombinant vectors comprising a 
structural nucleic acid sequence which encodes a deallergenized patatin protein. The 
recombinant vector comprises operatively linked in the 5' to 3' orientation: a promoter 
that directs transcription of a structural nucleic acid sequence; a structural nucleic acid 
sequence, and a 3' transcription terminator. 

The structural nucleic acid sequence can encode SEQ ID N0:2 modified in one or 
more of the following regions, or SEQ ID NO: 7 modified in one or more of the following 
regions. The single or multiple amino acid modifications reduce the binding of the 
modified protein relative to the binding of the corresponding unmodified protein. The 
regions for modification include amino acid positions 104-113 of SEQ ID N0:2 (85-94 
of SEQ ID N0:7), 128-137 of SEQ ID N0:2 (109-118 of SEQ ID N0:7), 184-197 of 
SEQ ID N0:2 (165-178 of SEQ ID N0:7), 264-277 of SEQ ID N0:2 (245-258 of SEQ 
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ID N0:7), 316-325 of SEQ ID N0:2 (297-306 of SEQ ID N0:7), and 360-377 of SEQ ID 
N0:2 (341-358 of SEQ ID N0:7). The possible amino acid modifications include 
replacing an amino acid with A, E, F, P, or S. The modifications replace one or more 
amino acids in the identified regions, without increasing or decreasing the total number of 
amino acids in the protein. 

Altematively, the recombinant vector comprises operatively linked in the 5' to 3' 
orientation: a promoter that directs transcription of a structural nucleic acid sequence; a 
structural nucleic acid sequence encoding SEQ ID N0:2 modified by one or more of the 
following changes or encoding SEQ ID N0:7 modified by one or more of the following 
changes: the Y corresponding to position 106 of SEQ ID N0:2 or position 87 of SEQ ID 
N0:7 is replaced with F or A; the I corresponding to position 113 of SEQ ID N0:2 or 
position 94 of SEQ ID N0:7 is replaced with A; the Y corresponding to position 129 of 
SEQ ID N0:2 or position 110 of SEQ ID N0:7 is replaced with F or A; the K 
corresponding to position 137 of SEQ ID N0:2 or position 118 of SEQ ID N0:7 is 
replaced with A; the S corresponding to position 184 of SEQ ID N0:2 or position 165 of 
SEQ ID N0:7 is replaced with A; the Y corresponding to position 185 of SEQ ID N0:2 
or position 166 of SEQ ID N0:7 is replaced with F or A; the A corresponding to position 
188 of SEQ ID N0:2 or position 169 of SEQ ID N0:7 is replaced with S; the T 
corresponding to position 192 of SEQ ID N0:2 or position 173 of SEQ ID N0:7 is 
replaced with A or P; the Y corresponding to position 193 of SEQ ID N0:2 or position 
174 of SEQ ID N0:7 is replaced with F or A; the K corresponding to position 268 of 
SEQ ID N0:2 or position 249 of SEQ ID N0:7 is replaced with A or E; the T 
corresponding to position 269 of SEQ ID N0:2 or position 250 of SEQ ID N0:7 is 
replaced with A; the Y corresponding to position 270 of SEQ ID N0;2 or position 25 1 of 
SEQ ID N0:7 is replaced with F or A; the K corresponding to position 273 of SEQ ID 
N0:2 or position 254 of SEQ ID N0:7 is replaced with A; the K corresponding to 
position 313 of SEQ ID N0:2 or position 294 of SEQ ID N0:7 is replaced with E; the N 
corresponding to position 314 of SEQ ID N0:2 or position 295 of SEQ ID N0:7 is 
replaced with A; the N corresponding to position 3 1 5 of SEQ ID N0:2 or position 296 of 
SEQ ID N0:7 is replaced with A; the Y corresponding to position 316 of SEQ ID N0:2 
or position 297 of SEQ ID N0:7 is replaced with F or A; the Y corresponding to position 
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362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F; the K 
corresponding to position 367 of SEQ ID N0:2 or position 348 of SEQ ID N0:7 is 
replaced with A; the R corresponding to position 368 of SEQ ID N0:2 or position 349 of 
SEQ ID N0:7 is replaced with A; the F corresponding to position 369 of SEQ ID N0:2 
or position 350 of SEQ ID N0:7 is replaced with A; the K corresponding to position 371 
of SEQ ID N0:2 or position 352 of SEQ ID N0:7 is replaced with A; the L 
corresponding to position 372 of SEQ ID N0:2 or position 353 of SEQ ID N0:7 is 
replaced with A; and the L corresponding to position 373 of SEQ ID N0:2 or position 
354 of SEQ ID N0:7 is replaced with A; and a 3' transcription terminator. 

More preferably, the vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 modified by the 
following changes: the Y corresponding to position 106 of SEQ ID N0:2 or position 87 
of SEQ ID N0:7 is replaced with F; the Y corresponding to position 129 of SEQ ID 
N0:2 or position 110 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 or position 25 1 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 
or position 297 of SEQ ID N0:7 is replaced with F; and the Y corresponding to position 
362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Most preferably, the vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0;2 modified by the following changes or SEQ ID N0:7 modified by the 
following changes: the Y corresponding to position 185 of SEQ ID N0:2 or position 166 
of SEQ ID N0:7 is replaced with F; the Y corresponding to position 193 of SEQ ID 
N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 316 of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is 
replaced with F; and the Y corresponding to position 362 of SEQ ID N0:2 or position 
343 of SEQ ID N0:7 is replaced with F. 
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Recombinant host cells 



A further embodiment of the invention is directed towards recombinant host cells 
comprising a structural nucleic acid sequence encoding a deallergenized patatin protein. 
The recombinant host cell preferably produces a deallergenized patatin protein. More 
preferably, the recombinant host cell produces a deallergenized patatin protein in a 
concentration sufficient to inhibit growth or to kill an insect which ingests the 
recombinant host cell. The recombinant host cell can generally comprise any structural 
nucleic acid sequence encoding a deallergenized patatin protein. 

The recombinant host cell can comprise a structural nucleic acid sequence 
encoding SEQ ID N0:2 modified in one or more of the following regions, or SEQ ID 
N0:7 modified in one or more of the following regions. The single or multiple amino 
acid modifications reduce the binding of the modified protein relative to the binding of 
the corresponding urmiodified protein. The regions for modification include amino acid 
positions 104-113 of SEQ IDN0:2 (85-94 of SEQ ID N0:7), 128-137 of SEQ ID N0:2 
(109-118 of SEQ ID N0:7), 184-197 of SEQ ID N0:2 (165-178 of SEQ ID N0:7), 264- 
277 of SEQ ID N0:2 (245-258 of SEQ ID N0:7), 316-325 of SEQ ID N0:2 (297-306 of 
SEQ ID N0:7), and 360-377 of SEQ ID N0:2 (341-358 of SEQ ID N0:7). The possible 
amino acid modifications include replacing an amino acid with A, E, F, P, or S. The 
modifications replace one or more amino acids in the identified regions, without 
increasing or decreasing the total number of amino acids in the protein. 

Alternatively, the recombinant host cell comprises a structural nucleic acid 
sequence encoding SEQ ID N0:2 modified by one or more of the following changes or 
encoding SEQ ID N0:7 modified by one or more of the following changes; the Y 
corresponding to position 106 of SEQ ID N0:2 or position 87 of SEQ ID N0:7 is 
replaced with F or A; the I corresponding to position 1 13 of SEQ ID N0:2 or position 94 
of SEQ ID N0:7 is replaced with A; the Y corresponding to position 129 of SEQ ID 
N0:2 or position 1 10 of SEQ ID N0:7 is replaced with F or A; the K corresponding to 
position 137 of SEQ ID N0:2 or position 1 18 of SEQ ID N0:7 is replaced with A; the S 
corresponding to position 184 of SEQ ID N0:2 or position 165 of SEQ ID N0:7 is 
replaced with A; the Y corresponding to position 185 of SEQ ID N0:2 or position 166 of 
SEQ ID N0:7 is replaced with F or A; the A corresponding to position 188 of SEQ ID 
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N0:2 or position 169 of SEQ ID N0:7 is replaced with S; the T corresponding to position 
192 of SEQ ID N0:2 or position 173 of SEQ ID N0:7 is replaced with A or P; the Y 
corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F or A; the K corresponding to position 268 of SEQ ID N0:2 or position 
249 of SEQ ID N0:7 is replaced with A or E; the T corresponding to position 269 of 
SEQ ID N0:2 or position 250 of SEQ ID NO: 7 is replaced with A; the Y corresponding 
to position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is replaced with F or 
A; the K corresponding to position 273 of SEQ ID N0:2 or position 254 of SEQ ID N0:7 
is replaced with A; the K corresponding to position 313 of SEQ ID N0:2 or position 294 
of SEQ ID N0:7 is replaced with E; the N corresponding to position 314 of SEQ ID 
N0:2 or position 295 of SEQ ID N0:7 is replaced with A; the N corresponding to 
position 315 of SEQ ID N0:2 or position 296 of SEQ ID N0:7 is replaced with A; the Y 
corresponding to position 316 of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is 
replaced with F or A; the Y corresponding to position 362 of SEQ ID N0:2 or position 
343 of SEQ ID N0:7 is replaced with F; the K corresponding to position 367 of SEQ ID 
N0:2 or position 348 of SEQ ID N0:7 is replaced with A; the R corresponding to 
position 368 of SEQ ID N0:2 or position 349 of SEQ ID N0:7 is replaced with A; the F 
corresponding to position 369 of SEQ ID N0:2 or position 350 of SEQ ID N0:7 is 
replaced with A; the K corresponding to position 371 of SEQ ID N0:2 or position 352 of 
SEQ ID NO: 7 is replaced with A; the L correspondmg to position 372 of SEQ ID N0:2 
or position 353 of SEQ ID N0:7 is replaced with A; and the L corresponding to position 
373 of SEQ ID N0:2 or position 354 of SEQ ID N0:7 is replaced with A. 

More preferably, the recombinant host cell comprises a structural nucleic acid 
sequence encoding SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 
modified by the following changes: the Y corresponding to position 106 of SEQ ID N0:2 
or position 87 of SEQ ID N0:7 is replaced with F; the Y corresponding to position 129 of 
SEQ ID N0:2 or position 110 of SEQ ID N0:7 is replaced with F; the Y corresponding 
to position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F; the 
Y corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 or position 251 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 
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or position 297 of SEQ ID NO: 7 is replaced with F; and the Y corresponding to position 
362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Most preferably, the recombinant host cell comprises a structural nucleic acid 
sequence encoding SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 
modified by the following changes: the Y corresponding to position 1 85 of SEQ ID N0:2 
or position 166 of SEQ ID N0:7 is replaced with F; the Y corresponding to position 193 
of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 or position 297 of 
SEQ ID N0:7 is replaced with F; and the Y corresponding to position 362 of SEQ ID 
N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

The recombinant host cell can generally be any type of host cell, and preferably is 
a bacterial, fungal, or plant cell. The bacterial cell is preferably an Escherichia coli 
bacterial cell. The fungal cell is preferably a Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, or Pichia pastor is fungal cell. The plant cell can be a 
monocot, dicot, or conifer plant cell. The plant cell is preferably an alfalfa, banana, 
canola, com, cotton, cucimiber, peanut, potato, rice, soybean, sunflower, sweet potato, 
tobacco, tomato, or wheat plant cell. The recombinant host cell preferably further 
comprises operatively linked to the structural nucleic acid sequence a promoter that 
directs transcription of the structural nucleic acid sequence. The recombinant host cell 
preferably further comprises operatively linked to the structural nucleic acid sequence a 
3' transcription terminator and a polyadenylation site. 

Recombinant plants 

An additional embodiment of the invention is a recombinant plant comprising a 
structural nucleic acid sequence encoding a deallergenized patatin protein. The 
recombinant plant preferably produces a deallergenized patatin protein. More preferably, 
the recombinant plant produces a deallergenized patatin protein in a concentration 
sufficient to inhibit grov^lh or to kill an insect which ingests plant tissue from the 
recombinant plant. 
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The recombinant plant can comprise a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified in one or more of the following regions, or SEQ ID N0:7 
modified in one or more of the following regions. The single or multiple amino acid 
modifications reduce the binding of the modified protein relative to the binding of the 
corresponding unmodified protein. The regions for modification include amino acid 
positions 104-113 of SEQ ID N0:2 (85-94 of SEQ ID N0:7), 128-137 of SEQ ID N0:2 
(109-118 of SEQ ID N0:7), 184-197 of SEQ ID N0:2 (165-178 of SEQ ID N0:7), 264- 
277 of SEQ ID N0:2 (245-258 of SEQ ID N0:7), 316-325 of SEQ ID N0:2 (297-306 of 
SEQ ID N0:7), and 360-377 of SEQ ID N0:2 (341-358 of SEQ ID N0:7). The possible 
amino acid modifications include replacing an amino acid with A, E, F, P, or S. The 
modifications replace one or more amino acids in the identified regions, without 
increasing or decreasing the total number of amino acids in the protein. 

Alternatively, the recombinant plant can comprise a structural nucleic acid 
sequence encoding SEQ ID N0:2 modified by one or more of the following changes or 
encoding SEQ ID NO: 7 modified by one or more of the following changes: the Y 
corresponding to position 106 of SEQ ID N0:2 or position 87 of SEQ ID N0:7 is 
replaced with F or A; the I corresponding to position 113 of SEQ ID N0:2 or position 94 
of SEQ ID N0:7 is replaced with A; the Y corresponding to position 129 of SEQ ID 
N0:2 or position 1 10 of SEQ ID N0:7 is replaced with F or A; the K corresponding to 
position 1 37 of SEQ ID N0:2 or position 1 1 8 of SEQ ID N0:7 is replaced with A; the S 
corresponding to position 184 of SEQ ID N0:2 or position 165 of SEQ ID N0:7 is 
replaced with A; the Y corresponding to position 185 of SEQ ID N0:2 or position 166 of 
SEQ ID N0:7 is replaced with F or A; the A corresponding to position 188 of SEQ ID 
N0:2 or position 169 of SEQ ID N0:7 is replaced with S; the T corresponding to position 
192 of SEQ ID N0:2 or position 173 of SEQ ID N0:7 is replaced with A or P; the Y 
corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F or A; the K corresponding to position 268 of SEQ ID N0:2 or position 
249 of SEQ ID N0:7 is replaced with A or E; the T corresponding to position 269 of 
SEQ ID N0:2 or position 250 of SEQ ID N0:7 is replaced with A; the Y corresponding 
to position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is replaced with F or 
A; the K corresponding to position 273 of SEQ ID N0:2 or position 254 of SEQ ID N0:7 
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is replaced with A; the K corresponding to position 313 of SEQ ID N0:2 or position 294 
of SEQ ID N0:7 is replaced with E; the N corresponding to position 314 of SEQ ID 
N0:2 or position 295 of SEQ ID N0:7 is replaced with A; the N corresponding to 
position 3 15 of SEQ ID N0:2 or position 296 of SEQ ID NO: 7 is replaced with A; the Y 
corresponding to position 316 of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is 
replaced with F or A; the Y corresponding to position 362 of SEQ ID N0:2 or position 
343 of SEQ ID N0:7 is replaced with F; the K corresponding to position 367 of SEQ ID 
N0:2 or position 348 of SEQ ID N0:7 is replaced with A; the R corresponding to 
position 368 of SEQ ID N0:2 or position 349 of SEQ ID N0:7 is replaced with A; the F 
corresponding to position 369 of SEQ ID N0:2 or position 350 of SEQ ID N0:7 is 
replaced with A; the K corresponding to position 371 of SEQ ID N0:2 or position 352 of 
SEQ ID N0:7 is replaced with A; the L corresponding to position 372 of SEQ ID N0:2 
or position 353 of SEQ ID NO: 7 is replaced with A; and the L corresponding to position 
373 of SEQ ID N0:2 or position 354 of SEQ ID N0:7 is replaced with A. 

More preferably, the recombinant plant comprises a structural nucleic acid 
sequence encoding SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 
modified by the following changes: the Y corresponding to position 106 of SEQ ID N0:2 
or position 87 of SEQ ID N0:7 is replaced with F; the Y corresponding to position 129 of 
SEQ ID N0:2 or position 1 10 of SEQ ID N0:7 is replaced with F; the Y corresponding 
to position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F; the 
Y corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 or position 25 1 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 
or position 297 of SEQ ID N0:7 is replaced with F; and the Y corresponding to position 
362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Most preferably, the recombinant plant comprises a structural nucleic acid 
sequence encoding SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 
modified by the following changes: the Y corresponding to position 185 of SEQ ID N0:2 
or position 166 of SEQ ID N0:7 is replaced with F; the Y corresponding to position 193 
of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is 
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replaced with F; the Y corresponding to position 3 16 of SEQ ID N0:2 or position 297 of 
SEQ ID N0:7 is replaced with F; and the Y corresponding to position 362 of SEQ ID 
N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

The recombinant plant can generally be any type of plant. The plant can be a 
monocot, dicot, or conifer plant. The plant is preferably an alfalfa, banana, canola, com, 
cotton, cucumber, peanut, potato, rice, soybean, sunflower, sweet potato, tobacco, 
tomato, or wheat plant. 

The recombinant plant preferably further comprises operatively linked to the 
structural nucleic acid sequence a promoter that directs transcription of the structural 
nucleic acid sequence. The recombinant plant preferably further comprises operatively 
linked to the structural nucleic acid sequence a 3' transcription terminator and a 
polyadenylation site. 

Methods of preparation 

Embodiments of the invention are ftirther directed towards methods of preparing 
recombinant host cells and recombinant plants useful for the production of deallergenized 
patatin proteins. 

A method of preparing a recombinant host cell useful for the production of 
deallergenized patatin proteins can comprise selecting a host cell; transforming the host 
cell with a recombinant vector; and obtaining recombinant host cells. 

The recombinant vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified in one or more of the following regions, or SEQ ID N0:7 
modified in one or more of the following regions. The single or multiple amino acid 
modifications reduce the binding of the modified protein relative to the binding of the 
corresponding unmodified protein. The regions for modification include amino acid 
positions 104-113 of SEQ ID N0:2 (85-94 of SEQ ID N0:7), 128-137 of SEQ ID N0:2 
(109-118 of SEQ ID N0:7), 184-197 of SEQ ID N0:2 (165-178 of SEQ ID N0:7), 264- 
277 of SEQ ID N0:2 (245-258 of SEQ ID N0:7), 316-325 of SEQ ID N0:2 (297-306 of 
SEQ ID N0:7), and 360-377 of SEQ ID N0:2 (341-358 of SEQ ID N0:7). The possible 
amino acid modifications include replacing an amino acid with A, E, F, P, or S. The 
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modifications replace one or more amino acids m the identified regions, without 
increasing or decreasing the total number of amino acids in the protein. 

Alternatively, the recombinant vector comprises a structural nucleic acid sequence 
encoding SEQ ID N0:2 modified by one or more of the following changes or encoding 
SEQ ID N0:7 modified by one or more of the following changes: the Y corresponding to 
position 106 of SEQ ID N0:2 or position 87 of SEQ ID N0:7 is replaced with F or A; the 
I corresponding to position 113 of SEQ ID N0:2 or position 94 of SEQ ID N0:7 is 
replaced with A; the Y corresponding to position 129 of SEQ ID N0:2 or position 110 of 
SEQ ID N0:7 is replaced with F or A; the K corresponding to position 137 of SEQ ID 
NO-2 or position 118 of SEQ ID N0:7 is replaced with A; the S corresponding to 
position 184 of SEQ ID N0:2 or position 165 of SEQ ID N0:7 is replaced with A; the Y 
corresponding to position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is 
replaced with F or A; the A corresponding to position 188 of SEQ ID N0:2 or position 
169 of SEQ ID NO:7 is replaced with S; the T corresponding to position 192 of SEQ ID 
NO-2 or position 173 of SEQ ID N0:7 is replaced with A or P; the Y coiresponding to 
position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is replaced with F or A; 
the K corresponding to position 268 of SEQ ID N0:2 or position 249 of SEQ ID N0:7 is 
replaced with A or E; the T corresponding to position 269 of SEQ ID N0:2 or position 
250 of SEQ ID N0:7 is replaced with A; the Y corresponding to position 270 of SEQ ID 
N0:2 or position 251 of SEQ ID N0:7 is replaced with F or A; the K corresponding to 
position 273 of SEQ ID N0:2 or position 254 of SEQ ID N0:7 is replaced with A; the K 
corresponding to position 313 of SEQ ID NO:2 or position 294 of SEQ ID N0:7 is 
replaced with E; the N corresponding to position 314 of SEQ ID N0:2 or position 295 of 
SEQ ID N0:7 is replaced with A; the N corresponding to position 315 of SEQ ID N0:2 
or position 296 of SEQ ID N0:7 is replaced with A; the Y corresponding to position 316 
of SEQ ID NO:2 or position 297 of SEQ ID N0:7 is replaced with F or A; the Y 
corresponding to position 362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is 
replaced with F; the K corresponding to position 367 of SEQ ID N0:2 or position 348 of 
SEQ ID N0:7 is replaced with A; the R corresponding to position 368 of SEQ ID N0:2 
or position 349 of SEQ ID N0:7 is replaced with A; the F corresponding to position 369 
of SEQ ID N0:2 or position 350 of SEQ ID N0:7 is replaced with A; the K 
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corresponding to position 371 of SEQ ID N0:2 or position 352 of SEQ ID N0:7 is 
replaced with A; the L corresponding to position 372 of SEQ ID N0:2 or position 353 of 
SEQ ID N0:7 is replaced with A; and the L corresponding to position 373 of SEQ ID 
N0:2 or position 354 of SEQ ID N0:7 is replaced with A. 

More preferably, the vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 modified by the 
following changes: the Y corresponding to position 106 of SEQ ID N0:2 or position 87 
of SEQ ID N0:7 is replaced with F; the Y corresponding to position 129 of SEQ ID 
N0:2 or position 110 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 or position 251 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 
or position 297 of SEQ ID N0:7 is replaced with F; and the Y corresponding to position 
362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Most preferably, the vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 modified by the 
following changes: the Y corresponding to position 185 of SEQ ID N0:2 or position 166 
of SEQ ID N0:7 is replaced with F; the Y corresponding to position 193 of SEQ ID 
N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 270 of SEQ ID NO:2 or position 251 of SEQ ID NO:7 is replaced with F; the Y 
corresponding to position 316 of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is 
replaced with F; and the Y corresponding to position 362 of SEQ ID N0:2 or position 
343 of SEQ ID N0:7 is replaced with F. 

The method can generally be used to prepare any type of recombinant host cell. 
Preferably, the method can be used to prepare a recombinant bacterial cell, a recombinant 
fungal cell, or a recombinant plant cell. The bacterial cell is preferably an Escherichia 
coli bacterial cell. The fungal cell is preferably a Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, or Pichia pastoris fungal cell. The plant cell can be a 
monocot, dicot, or conifer plant cell. The plant cell is preferably an alfalfa, banana. 
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canola, com, cotton, cucumber, peanut, potato, rice, soybean, sunflower, sweet potato, 
tobacco, tomato, or wheat plant cell. 

An additional embodiment is directed towards methods for the preparation of 
recombinant plants useful for the production of deallergenized patatin proteins. The 
method can comprise selecting a host plant cell; transforming the host plant cell with a 
recombinant vector; obtaining recombinant host cells; and regenerating a recombinant 
plant from the recombinant host plant cells. 

The recombinant vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified in one or more of the following regions, or SEQ ID N0:7 
modified in one or more of the following regions. The single or multiple amino acid 
modifications reduce the binding of the modified protein relative to the binding of the 
corresponding unmodified protein. The regions for modification include amino acid 
positions 104-1 13 of SEQ ID N0:2 (85-94 of SEQ ID N0:7), 128-137 of SEQ ID N0:2 
(109-118 of SEQ ID N0:7), 184-197 of SEQ ID N0:2 (165-178 of SEQ ID N0:7), 264- 
277 of SEQ ID N0:2 (245-258 of SEQ ID N0:7), 316-325 of SEQ ID N0:2 (297-306 of 
SEQ ID N0:7), and 360-377 of SEQ ID N0:2 (341-358 of SEQ ID N0:7). The possible 
amino acid modifications include replacing an amino acid with A, E, F, P, or S. The 
modifications replace one or more amino acids in the identified regions, without 
increasing or decreasing the total number of amino acids in the protein. 

Alternatively, the recombinant vector comprises a structural nucleic acid sequence 
encoding SEQ ID N0:2 modified by one or more of the following changes or encoding 
SEQ ID N0:7 modified by one or more of the following changes: the Y corresponding to 
position 106 of SEQ ID N0:2 or position 87 of SEQ ID N0:7 is replaced with F or A; the 
I corresponding to position 113 of SEQ ID NO:2 or position 94 of SEQ ID N0:7 is 
replaced with A; the Y corresponding to position 129 of SEQ ID N0:2 or position 110 of 
SEQ ID NO: 7 is replaced with F or A; the K corresponding to position 137 of SEQ ID 
N0:2 or position 118 of SEQ ID N0:7 is replaced with A; the S corresponding to 
position 1 84 of SEQ ID N0:2 or position 165 of SEQ ID N0:7 is replaced with A; the Y 
corresponding to position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is 
replaced with F or A; the A corresponding to position 188 of SEQ ID N0:2 or position 
169 of SEQ ID N0:7 is replaced with S; the T corresponding to position 192 of SEQ ID 
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N0:2 or position 173 of SEQ ID N0:7 is replaced with A or P; the Y corresponding to 
position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is replaced with F or A; 
the K corresponding to position 268 of SEQ ID N0:2 or position 249 of SEQ ID N0:7 is 
replaced with A or E; the T corresponding to position 269 of SEQ ID N0:2 or position 
250 of SEQ ID N0:7 is replaced with A; the Y corresponding to position 270 of SEQ ID 
N0:2 or position 251 of SEQ ID N0:7 is replaced with F or A; the K corresponding to 
position 273 of SEQ ID N0:2 or position 254 of SEQ ID N0:7 is replaced with A; the K 
corresponding to position 313 of SEQ ID N0:2 or position 294 of SEQ ID N0:7 is 
replaced with E; the N corresponding to position 314 of SEQ ID N0:2 or position 295 of 
SEQ ID N0:7 is replaced with A; the N corresponding to position 3 1 5 of SEQ ID N0:2 
or position 296 of SEQ ID N0:7 is replaced with A; the Y corresponding to position 316 
of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is replaced with F or A; the Y 
corresponding to position 362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is 
replaced with F; the K corresponding to position 367 of SEQ ID N0:2 or position 348 of 
SEQ ID N0:7 is replaced with A; the R corresponding to position 368 of SEQ ID N0:2 
or position 349 of SEQ ID NO: 7 is replaced with A; the F corresponding to position 369 
of SEQ ID N0:2 or position 350 of SEQ ID N0:7 is replaced with A; the K 
corresponding to position 371 of SEQ ID N0:2 or position 352 of SEQ ID N0:7 is 
replaced with A; the L corresponding to position 372 of SEQ ID N0:2 or position 353 of 
SEQ ID N0:7 is replaced with A; and the L corresponding to position 373 of SEQ ID 
N0:2 or position 354 of SEQ ID N0:7 is replaced with A. 

More preferably, the vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 modified by the 
following changes: the Y corresponding to position 106 of SEQ ID N0:2 or position 87 
of SEQ ID N0:7 is replaced with F; the Y corresponding to position 129 of SEQ ID 
N0:2 or position 110 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 185 of SEQ ID N0:2 or position 166 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 193 of SEQ ID N0:2 or position 174 of SEQ ID N0:7 is 
replaced with F; the Y corresponding to position 270 of SEQ ID N0:2 or position 251 of 
SEQ ID N0:7 is replaced with F; the Y corresponding to position 316 of SEQ ID N0:2 
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or position 297 of SEQ ID N0:7 is replaced with F; and the Y corresponding to position 
362 of SEQ ID N0:2 or position 343 of SEQ ID N0:7 is replaced with F. 

Most preferably, the vector comprises a structural nucleic acid sequence encoding 
SEQ ID N0:2 modified by the following changes or SEQ ID N0:7 modified by the 
following changes: the Y corresponding to position 185 of SEQ ID N0:2 or position 166 
of SEQ ID N0:7 is replaced with F; the Y corresponding to position 193 of SEQ ID 
N0:2 or position 174 of SEQ ID N0:7 is replaced with F; the Y corresponding to 
position 270 of SEQ ID N0:2 or position 251 of SEQ ID N0:7 is replaced with F; the Y 
corresponding to position 316 of SEQ ID N0:2 or position 297 of SEQ ID N0:7 is 
replaced with F; and the Y corresponding to position 362 of SEQ ID N0:2 or position 
343 of SEQ ID N0:7 is replaced with F. 

The recombinant plant can generally be any type of plant. The plant can be a 
monocot, dicot, or conifer plant. The plant is preferably an alfalfa, banana, canola, com, 
cotton, cucumber, peanut, potato, rice, soybean, sunflower, sweet potato, tobacco, 
tomato, or wheat plant. 

Deallergenized patatin proteins can be prepared by isolating the deallergenized 
patatin protein from any one of the above described host cells or plants. 

Deglvcosvlation 

The examples herein provide evidence that glycosylation of can contribute to the 
allergenicity of a protein. Accordingly, rational substitution of amino acid residues likely 
to be the targets of glycosylation within a subject allergen protein may reduce or 
eliminate the allergenic properties of the protein without adversely affecting the 
enzymatic, insecticidal, antifungal or other functional properties of the protein. 

Glycosylation commonly occurs as either N-linked or 0-linked forms. N-linked 
glycosylation usually occurs at the motif Asn-Xaa-Ser/Thr, where Xaa is any amino acid 
except Pro (Kasturi, L. et ai., Biochem J. 323: 415-519, 1997; Melquist, J.L. et al.. 
Biochemistry 37: 6833-6837, 1998). 0-linked glycosylation occurs between the hydroxyl 
group of serine or threonine and an amino sugar. 

Site directed mutagenesis of selected asparagine, serine, or threonine may be used 
to reduce or eliminate the glycosylation of patatin proteins. A search of SEQ ID N0:2 
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for the Asn-Xaa-Ser/Thr motif reveals one occurrence at amino acid positions 202-204. 
Mutagenization of the nucleic acid sequence encoding this region may result in a reduced 
allergenicity of the encoded protein. 

In order to test this conceptual approach to reducing allergenicity of patatin 
proteins, two sets of experiments were performed: a) production of patatin proteins in 
Escherichia coli, which do not glycosylate proteins; and b) production of patatin proteins 
with an N202Q site directed mutation. 

Antibodies obtained from patients HS-07 and G15-M0N (not potato allergic) did 
not show specific binding to wild type patatin, patatin produced in E. coli, or the N202Q 
variant. Antibodies obtained from patient HS-01 (potato allergic) bound to wild type 
patatin, but not to patatin produced in E. coli or the N202Q variant. Antibodies obtained 
from patient HS-02 (potato allergic) bound strongly to wild type patatin, but extremely 
weakly to patatin produced in E. coli, and binding to the N202Q variant resembled vector 
controls. Antibodies obtained from patient HS-03 (potato allergic) bound to wild type 
patatin, but not to patatin produced in E. coli or the N202Q variant. Antibodies obtained 
from patient HS-05 (potato allergic) bound to wild type patatin, but very weakly to 
patatin produced in E. coli and the N202Q variant. Antibodies obtained from patient HS- 
06 (potato allergic) strongly bound wild type patatin, the N202Q variant, and to patatin 
produced in E. coli. These results strongly suggest that glycosylation is at least partially 
responsible for the antigenic properties of patatin proteins, and that site directed 
mutagenesis may be used to reduce or eliminate specific antibody binding. Mutagenesis 
at position 202 of SEQ ID N0:2 may be useful for reducing or eliminating specific 
antibody binding. 

Permuteins 

The positions of the internal breakpoints described in the following Examples are 
found on the protein surface, and are distributed throughout the linear sequence without 
any obvious bias towards the ends or the middle. Breakpoints occurring below the 
protein surface can additionally be selected. The rearranged two subunits can be joined 
by a peptide linker. A preferred embodiment involves the linking of the N-terminal and 
C-terminal subunits by a three amino acid linker, although linkers of various sizes can be 
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1 used. Additionally, the N-terminal and C-terminal subunits can be joined lacking a linker 

2 sequence. Furthermore, a portion of the C-terminal subunit can be deleted and the 

3 connection made from the truncated C-terminal subunit to the original N-terminal subunit 

4 and vice versa as previously described (Yang and Schachman, Proc. Natl. Acad. Sci. 

5 U.S.A.. 90: 11980-11984, 1993; Viguera, et ai, Mol. Biol, 247: 670-681, 1995; 

6 Protasova, et al, Prot. Eng., 7: 1373-1377, 1994). 

7 The novel insecticidal proteins of the present invention can be represented by the 

8 formula: 

9 X'-(L)a-X^ 

10 wherein; 

11 a is 0 or 1, and if a is 0, then the permutein does not contain a linker 

12 sequence; 

13 X' is a polypeptide sequence corresponding to amino acids n+1 through J; 

14 X'^ is a polypeptide corresponding to amino acids 1 through n; 

15 n is an integer ranging from 1 to J-1 ; 

16 J is an integer greater than n+1 ; and 

17 L is a linker. 

18 In the formula above, the constituent amino acid residues of the novel insecticidal 

19 protein are numbered sequentially 1 through J from the original amino terminus to the 

20 original carboxyl terminus. A pair of adjacent amino acids within this protein can be 

21 numbered n and n+1 respectively where n is an integer ranging from 1 to J-1. The 

22 residue n+1 becomes the new N-terminus of the novel insecticidal protein and the residue 

23 n becomes the new C-terminus of the novel insecticidal protein. 

24 For example, a parent protein sequence consisting of 120 amino acids can be 

25 selected as a starting point for designing a permutein (J=120). If the breakpoint is 

26 selected as being between position 40 and position 41, then n=40. If a linker is selected 

27 to join the two subunits, the resulting permutein will have the formula: (amino acids 41- 

28 120)-L-(amino acids 1-40). If a linker was not used, the resulting permutein will have the 

29 formula: (amino acids 41-120)-(amino acids 1-40). 

30 The length of the amino acid sequence of the linker can be selected empirically, 

31 by using structural information, or by using a combination of the two approaches. When 
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no structural information is available, a small series of linJcers can be made whose length 
can span a range of 0 to 50 A and whose sequence is chosen in order to be substantially 
consistent with surface exposure (Hopp and Woods, Mol. Immunol, 20: 483-489, 1983; 
Kyte and Doolittle, J. Mol. Biol. 157: 105-132, 1982; Lee and Richards, J. Mol Biol, 
55: 379-400, 1971) and the ability to adopt a conformation which does not significantly 
affect the overall configuration of the protein (Karplus and Schulz, Naturwissenschaften, 
72: 212-213, 1985). Assuming an average length of 2.0 to 3.8 A per residue, this would 
mean the length to test would be between about 0 to about 30 residues, with 0 to about 1 5 
residues being the preferred range. Accordingly, there are many such sequences that vary 
in length or composition that can serve as linkers with the primary consideration being 
that they be neither excessively long nor excessively short (Sandhu, et al. Critical Rev. 
Biotech, 12: Al)lAei, 1992). If the linker is too long, entropy effects may destabilize the 
three-dimensional fold and may affect protein folding. If the linker is too short, it may 
destabilize the molecule due to torsional or steric strain. 

Use of the distance between the chain ends, defined as the distance between the 
C-alpha carbons, can be used to define the length of the sequence to be used, or at least to 
limit the number of possibilities that can be tested in an empirical selection of linkers. 
Using the calculated length as a guide, linkers with a range of number of residues 
(calculated using 2 to 3.8 A per residue) can be selected. These linkers can be composed 
of the original sequence, shortened or lengthened as necessary, and when lengthened the 
additional residues can be chosen to be flexible and hydrophilic as described above; or 
optionally the original sequence can be substituted for using a series of linkers, one 
example being Gly-Pro-Gly (SEQ ID NO:277); or optionally a combination of the 
original sequence and new sequence having the appropriate total length can be used. An 
alternative short, flexible linker sequence is Gly-Gly-Gly-Ser-Gly-Gly-Gly (SEQ ID 
NO:276). 

Selection of permutein breakpoints 

Sequences of novel patatin analogs capable of folding to biologically active 
molecules can be prepared by appropriate selection of the beginning (amino terminus) 
and ending (carboxyl terminus) positions from within the original polypeptide chain 
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while optionally using a linker sequence as described above. Amino and carboxyl 
termini can be selected from within a common stretch of sequence, referred to as a 
breakpoint region, using the guidelines described below. A novel amino acid sequence is 
thus generated by selecting amino and carboxyl termini from within the same breakpoint 
region. In many cases, the selection of the new termini will be such that the original 
position of the carboxyl terminus immediately preceded that of the amino terminus. 
However, selections of termini anywhere within the region may result in a functional 
protein, and that these will effectively lead to either deletions or additions to the amino or 
carboxyl portions of the new sequence. 

The primary amino acid sequence of a protein dictates folding to the three- 
dimensional structure beneficial for expression of its biological function. It is possible to 
obtain and interpret three-dimensional structural information using x-ray diffraction of 
single protein crystals or nuclear magnetic resonance spectroscopy of protein solutions. 
Examples of structural information that are relevant to the identification of breakpoint 
regions include the location and type of protein secondary structure (alpha and 3-10 
helices, parallel and anti-parallel beta sheets, chain reversals and turns, and loops 
(Kabsch and Sander, Biopolymers, 22: 2577-2637, 1983), the degree of solvent exposure 
of amino acid residues, the extent and type of interactions of residues with one another 
(Chothia, C, Ann. Rev. Biochem., 53: 537-572, 1984), and the static and dynamic 
distribution of conformations along the polypeptide chain (Alber and Mathews, Methods 
Enzymol, 154: 511-533, 1987). In some cases additional information is known about 
solvent exposure of residues, one example is a site of post-translational attachment of 
carbohydrate which is necessarily on the surface of the protein. When experimental 
structural information is not available, or when it is not feasible to obtain the information, 
methods are available to analyze the primary amino acid sequence in order to make 
predictions of protein secondary and tertiary structure, solvent accessibility and the 
occurrence of turns and loops (Fasman, G., Ed. Plenum, New York, 1989; Robson, B. 
and Gamier, J. Nature 361 : 506, 1993). 

Biochemical methods can be applicable for empirically determining surface 
exposure when direct structural methods are not feasible; for example, using the 
identification of sites of chain scission following limited proteolysis in order to infer 
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surface exposure (Gentile, F. and Salvatore, G., Eur. J. Biochem.. 218: 603-621, 1993). 
Thus, using either the experimentally derived structural information or predictive 
methods (Srinivasan, R. and Rose, G.D. Proteins, 22: 81-99, 1995), the parental amino 
acid sequence can be analyzed to classify regions according to whether or not they are 
integral to the maintenance of secondary and tertiary structure. The sequences within 
regions that are known to be involved in periodic secondary structure (alpha and 3-10 
helices, parallel and anti-parallel beta sheets) are regions that should be avoided. 
Similarly, regions of amino acid sequence that are observed or predicted to have a low 
degree of solvent exposure are more likely to be part of the so-called hydrophobic core of 
the protein and should also be avoided for selection of amino and carboxyl termini. 
Regions that are known or predicted to be in surface turns or loops, and especially those 
regions that are known not to be required for biological activity, can be preferred sites for 
new amino and carboxyl termini. Stretches of amino acid sequence that are preferred 
based on the above criteria can be selected as breakpoint regions. 

An embodiment of the invention is directed towards patatin permutein proteins. 
The permutein proteins preferably maintain esterase activity and insecticidal properties. 
The permutein proteins preferably are less allergenic than the wild type patatin protein to 
individuals or animals allergic to potatoes. This can be assayed by the binding of 
antibodies to the wild type patatin and patatin permutein proteins. 

The permutein proteins can optionally contain a linker sequence. The linker can 
generally be any amino acid sequence, preferably is Gly-Gly-Gly-Ser-Gly-Gly-Gly (SEQ 
ID NO:276) or Gly-Pro-Gly (SEQ ID NO:277), and more preferably is Gly-Pro-Gly 
(SEQ ID NO:277). Specific permutein proteins comprise: (amino acids 247-386 of SEQ 
ID N0:2)-linker-( amino acids 24-246 of SEQ ID N0:2), (amino acids 269-386 of SEQ 
ID N0:2)-linker-( amino acids 24-268 of SEQ ID N0:2), SEQ ID NO:247, and SEQ ID 
NO:259. 

Embodiments of the invention also include isolated nucleic acid molecule 
segments comprising a structural nucleic acid sequence encoding a patatin permutein 
protein. The encoded permutein protein can generally be any permutein protein, and 
preferably comprises (amino acids 247-386 of SEQ ID NO:2)-linker-(amino acids 24-246 
of SEQ ID N0:2), (amino acids 269-386 of SEQ ID NO:2)-linker-(amino acids 24-268 of 
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SEQ ID N0:2), SEQ ID NO:247, or SEQ ID NO:259. The linker can generally be any 
amino acid sequence, preferably is Gly-Gly-Gly-Ser-Gly-Gly-Gly (SEQ ID NO:276) or 
Gly-Pro-Gly (SEQ ID NO:277), and more preferably is Gly-Pro-Gly (SEQ ID NO:277). 
Alternatively, the encoded patatin permutein protein can lack a linker sequence. The 
structural nucleic acid sequence is preferably SEQ ID NO:246 or SEQ ID NO:258. 

An embodiment of the invention is directed towards recombinant vectors which 
encode a patatin permutein protein. The vector can comprise operatively linked in the 5' 
to 3' orientation: a promoter that directs transcription of a structural nucleic acid 
sequence; a structural nucleic acid sequence encoding a protein selected from the group 
consisting of: (amino acids 247-386 of SEQ ID NO:2)-linker-(amino acids 24-246 of 
SEQ ID N0:2); and (amino acids 269-386 of SEQ ID NO:2)-linker-(amino acids 24-268 
of SEQ ID N0:2); and a 3' transcription terminator. The linker can comprise Gly-Pro- 
Gly (SEQ ID NO:277) or Gly-Gly-Gly-Ser-Gly-Gly-Gly (SEQ ID NO:276). 
Alternatively, the encoded patatin permutein protein can lack a linker sequence. The 
structural nucleic acid sequence can preferably be SEQ ID NO:246 or SEQ ID NO:258, 
and preferably encodes SEQ ID NO:247 or SEQ ID NO:259. 

An additional embodiment of the invention is directed towards recombinant host 
cells useful for the production of a patatin permutein protein. The recombinant host cell 
preferably produces a patatin permutein protein. More preferably, the recombinant host 
cell produces a patatin permutein protein in a concentration sufficient to inhibit growth or 
to kill an insect which ingests the recombinant host cell. The recombinant host cell can 
comprise a structural nucleic acid sequence encoding a protein selected from the group 
consisting of: (amino acids 247-386 of SEQ ID NO:2)-linker-(amino acids 24-246 of 
SEQ ID N0:2); and (amino acids 269-386 of SEQ ID NO:2)-linker-(amino acids 24-268 
of SEQ ID N0:2). The linker can generally be any amino acid sequence, and preferably 
is Gly-Pro-Gly (SEQ ID NO:277) or Gly-Gly-Gly-Ser-GIy-Gly-Gly (SEQ ID NO:276). 
Alternatively, the encoded patatin permutein protein can lack a linker sequence. The 
structural nucleic acid sequence is preferably SEQ ID NO:246 or SEQ ID NO:258, and 
preferably encodes SEQ ID NO:247 or SEQ ID NO:259. The structural nucleic acid 
sequence can be operatively linked to a promoter sequence that directs transcription of 
the structural nucleic acid sequence, a 3' transcription terminator, and a 3' 
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polyadenylation signal sequence. The recombinant host cell can generally be any type of 
host cell, and preferably is a bacterial, fungal, or plant host cell. The bacterial cell is 
preferably an Escherichia colt bacterial cell. The fungal cell is preferably a 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, or Pichia pastoris fungal cell. 
The plant cell can be a monocot, dicot, or conifer plant cell. The plant cell is preferably 
an alfalfa, banana, canola, com, cotton, cucumber, peanut, potato, rice, soybean, 
sunflower, sweet potato, tobacco, tomato, or wheat plant cell. 

An additional embodiment of the invention is directed towards recombinant plants 
which are useful for the production of palatin permutein proteins. The recombinant plant 
preferably produces a patatin permutein protein. More preferably, the recombinant plant 
produces a patatin permutein protein in a concentration sufficient to inhibit growth or to 
kill an insect which ingests tissue firom the recombinant plant. The recombinant plant can 
comprise a structural nucleic acid sequence encoding a protein selected from the group 
consisting of: (amino acids 247-386 of SEQ ID NO:2)-linker-(amino acids 24-246 of 
SEQ ID N0:2); and (amino acids 269-386 of SEQ ID NO:2)-linker-(amino acids 24-268 
of SEQ ID N0:2). The linker can comprise Gly-Pro-Gly (SEQ ID NO:277) or Gly-Gly- 
Gly-Ser-Gly-Gly-Gly (SEQ ID NO:276). Alternatively, the encoded protein can lack a 
linker sequence. The structural nucleic acid sequence is preferably SEQ ID NO:246 or 
SEQ ID NO:258, and preferably encodes SEQ ID NO:247 or SEQ ID NO:259. The 
structural nucleic acid sequence can be operatively linked to a promoter sequence that 
directs transcription of the structural nucleic acid sequence, a 3' transcription terminator, 
and a 3' polyadenylation signal sequence. The recombinant plant can generally be any 
type of plant, and preferably is an alfalfa, banana, canola, com, cotton, cucumber, peanut, 
potato, rice, soybean, sunflower, sweet potato, tobacco, tomato, or wheat plant. 

Permutein proteins can be prepared by isolating the permutein protein fi^om any 
one of the above described host cells or plants. 

Immunotherapy for potato allergy 

Immunotherapy for food allergy has been largely unsuccessful due to the lack of 
appropriate therapeutic reagents (Sampson, H.A., J. Allergy Clin. Immunol, 90(2): 151- 
152, 1992). Immunotherapy has typically involved the administration (orally or by 
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subcutaneous injections) of increasing doses of crude protein extracts of the offending 
allergenic entities which usually contain variable mixes of many different proteins 
(Scheiner, O., Wien Klin Wochenschr., 105(22): 653-658, 1993). While there are reports 
of highly successful clinical applications of immunotherapy for food allergens (Romano, 
P.C., etal.,Allergol. Immurtopathol. (Madr), 12(4): 275-281, 1984), those reports are rare 
and the clinical literature in general recommends avoidance far more strongly than 
therapy (Gay, G., Allerg. Immunol. (Paris), 29(6): 169-170, 1997). One of the primary 
reasons for the failure of many clinical attempts to induce tolerance to allergens in 
general and food allergens in particular relates to anecdotal comments by numerous 
allergists, that patients don't tolerate the doses of allergen required to achieve tolerance. 
Animal studies examining the relationship of antigen dose and the induction of tolerance 
have demonstrated a strong positive correlation (Chen, Y., et al., Proc. Natl. Acad. Sci., 
U.S.A., 93: 388-391, 1996; Tokai, T., etal., Nat. Biotechnol., 15(8): 754-758, 1997). Due 
to the very real possibility of inducing an anaphylactic reaction in patients with native 
allergen, most clinical therapists are quite hesitant to use high doses therapeutically and 
are therefore compromising the likelihood of successful therapy. 

In recent reports, recombinant technology has been used to reduce the allergenic 
potential of a major allergen without modifying the T cell epitopes, and allowing higher 
doses of protein to be used in therapy (Tokai, T., et al., Nat. Biotechnol., 15(8): 754-758, 
1997). In addition, a lack of understanding about the appropriate route of administration, 
the uncertainty of mechanisms responsible for induction of allergy and the uncertainty of 
mechanisms by which inmiunotherapy suppresses or blocks the T cell-IgE- 
eosinophil/mast cell cycle have contributed to the large number of equivocal studies and 
clinical trials. Recent studies in animal models dealing with mechanisms, routes of 
administration, adjuvants and vaccine formulations have increased the likelihood that 
immunotherapy for allergies, including food allergies, will become a reproducibly 
successful clinical treatment when the appropriate therapeutic reagents are available 
(Sampson, H.A. and Burks, A.W., Annu. Rev. Nutr., 16: 161-177, 1996; Kaminogawa, S., 
Biosci. Biotechnol. Biochem., 60(11): 1749-1756, 1996; Chapman, M.D., et al.. Allergy, 
52: 374-379, 1997; Barbeau, W.E.,Adv. Exp. Med Biol., 415: 183-193, 1997; Cao, Y., et 
al., Immunology, 90(1): 46-51, 1997; Garside, P. and Mowat, A.M., Crit. Rev. Immunol., 
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17(2): 1 19-137, 1997; Rothe, M.J. and Grant-Kels, J.M., J. Am. Acad. Dermatol, 35(1): 
1-13, 1996; Strobel, S., Allergy, 50(20): 18-25, 1995; Kniisbeek, A.M. and Amsen, D., 
Curr. Opin. Immunol, 8(2): 233-244, 1996; Herz, U., et al., Adv. Exp. Med Biol, 409: 
25-32, 1996; Litwin, A., et al, J. Allergy Clin. Immunol, 100: 30-38, 1997; 
Vandewalker, M.L., Mo. Med, 94(7): 311, 1997; Marshall, G.D., Jr. and Davis, F., Nat. 
Biotechnol, 15(8): 718-719, 1997; Van Deusen, M.A., et al, Ann. Allergy Asthma 
Immunol, 78: 573-580, 1997; Jacobsen, L., et al. Allergy, 52: 914-920, 1997, Scheiner, 
O. and Kraft, D., Allergy 50(5): 384-391, 1995). 

Relative to immunotherapy, the critical aspects of the modified patatin genes 
described in this patent are that they can be used to synthesize purified, deallergenized- 
protein which can be used for patatin (potato) specific immunotherapy, with reduced 
potential for adverse and potentially fatal anaphylactic reactions in human or veterinary 
patients who have allergies to patatin or potatoes. Various strategies, including fixing or 
cross linking allergens, encapsulation of allergen for oral delivery, the use of small, T-cell 
epitope peptides and most recently, the use of engineered recombinant proteins, or 
modified gene vaccines are being tested in attempts to decrease the potential for 
anaphylactic reactions while inducing tolerance (Cao, Y., et al. Immunology, 90(1): 46- 
51, 1997; Chapman, M.D., et al. Allergy, 52: 374-379, 1997; Chapman, M.D., et al. Int. 
Arch. Allergy Immunol, 113(1-3): 102-104, 1997; Collins, S.P., et al, Clin. Exp Allergy, 
26(1): 36-42, 1996; Takai, T., et al, Mol Immunol, 34(3): 255-261, 1997; Takai, T., et 
al, Nat. Biotechnol, 15(8) 754-758, 1997; Jirapongsananruk, O. and Leung, D.Y.M., 
Ann. Allergy Asthma Immunol, 79: 5-20, 1997; Litwin, A., et al, J. Allergy Clin. 
Immunol, 100: 30-38, 1997; Vandewalker, M.L., Mo. Med, 94(7): 311, 1997; Raz, E., et 
al, Proc. Natl Acad Sol, U.S.A., 93: 5141-5145, 1996; Hoyne, G.F., et al, Clin. 
Immunol Immunopathol , 80: S23-30, 1996; Hoyne, G.F., et al. Int. Immunol, 9(8): 
1165-1173, 1997; Vrtala, S., et al, J. Clin. Invest., 99(7): 1673-1681, 1997; Sato, Y., et 
al. Science, 273: 352-354, 1996; Lee, D.J., et al. Int. Arch. Allergy Immunol, 113(1-3): 
227-230, 1997; Tsitoura, D.C., et al,J. Immunol, 157(5): 2160-2165, 1996; Hsu, C.H., 
et al. Int. Immunol, 8(9): 1405-1411, 1996; Hsu, C.H., et al, Nat. Med., 2(5): 540-544, 
1996). 



I194!S(82MF01! DCX) 



-46- of -213- 



The instant invention uses an engineered patatin protein, as expressed in any 
living cell, with or without post-synthesis modifications, for immunotherapy by the 
routes of cutaneous or subcutaneous exposure, injection, or by oral, gastro-intestinal, 
respiratory or nasal application, either with, or without the use of specific carriers, 
vehicles and adjuvants. The direct application of nucleic acid encoding recombinant 
patatin as the in vivo (in the patient) expression template (gene) as RNA-, DNA- or gene- 
vaccines is also the intended use of the engineered genetic materials defined here, coding 
for patatin, but with modified IgE binding sites. It is also the intent of this patent to cover 
the use of these modified genes described here including insertion into various DNA 
vectors including adenovirus, retrovirus, pox virus and replicating or non-replicating 
eukaryotic expression plasmids (Lee, D.J., et al. Int. Arch. Allergy Immunol, 1 13(1-3): 
227-230, 1997) with various promoters and regulatory sequences, which can be inserted 
into the patient's somatic cells (dendritic cells, epithelial cells, muscle fiber-cells, 
fibroblasts, etc.) for the purpose of expressing the recombinant gene product to alter the 
patient's immune response to the patatin proteins (Lee D.J., et al, Int. Arch. Allergy 
Immunol, 113(1-3): 227-230, 1997). Potential routes of administration foreseen in this 
application include previously described methods of encapsulation, emulsion, receptor or 
membrane fusion mediated uptake and methods of direct permeabilization or insertion of 
the DNA or corresponding RNA into the host cells. 

The following examples are included to demonstrate preferred embodiments of 
the invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques discovered by the inventors 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 
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EXAMPLES 
Example 1 : Identification of patatin as an allergen 



Since patatin is commonly obtained from an allergenic source (potato), it was 
hypothesized that patatins in fact encode an important class of offending potato allergens 
(patatin was reported as allergenic by Seppala, U. et al, J. Allergy Clin. Immunol. 103: 
165-171, 1999). Assessment of potential allergens preferably include appropriate in vitro 
testing for IgE binding, in this case with potato allergic sera (Fuchs, R.L. and Astwood, 
J.D., Food Technology, 50: 83-88, 1996; Astwood, J.D., et al. Monographs in allergy 
Vol. 32: Highlights in food allergy, pp. 105-120, 1996, Metcalfe, D.D., et al.. Critical 
Reviews in Food Science and Nutrition, 36S: 165-186, 1996). It is the recommendation 
of a working group organized by the IFBC and the ILSI Allergy and Immunology 
Institute that proteins encoded by nucleic acid sequences from allergenic sources such as 
potato (a "less-commonly" allergenic source) should be examined for their ability to react 
with IgEs of potato-allergic patients using a minimum of five individual patient sera 
(Metcalfe, D.D., et al.. Critical Reviews in Food Science and Nutrition, 36S: 165-186, 
1996). Patatin- 17 protein was tested for IgE binding using standard in vitro testing with 
serum taken from patients with bona fide well defined clinically displayed potato allergy 
as described below. 

Clinical Characterization of Potato Allergic Subjects (Serum donors) 

Patients who suffer from potato allergy were identified at Johns Hopkins Clinic 
(Baltimore, MD) and were evaluated for potato allergy using clinical criteria outlined in 
Table 2. 

Serum was obtained from patients with convincing clinical history of potato 
allergy. The convincing history was defined as being one or more of the following: a) 
positive potato allergic as evaluated by double-blind placebo-control food challenge b) 
anaphylaix and/or hospitalization due to the consumption of potatoes or c) dramatic skin 
test results. 
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Table 2: Clinical patient data 



Patient 


Clinical History 


Flare/Wheal (Skin prick 
test) 


DBPCFC (potato) 1 


HSOl 


Most recent hospitalization: 10/19/93 

AD, A, AR, FH, MFS, IgE =1397 KIAUa/L 


7/19,4/14,7/17 


Not performed 1 


HS02 


Most recent hospitalization! 6/94 

AD, FH, Latex (+) RAST, MFS, lgE=7544K/L 


20/26 


Not performed 1 


HS03 


AD, A, FH, MFS, IgE = N/A 


5/13 


Yes 


HS05 


Most recent hospitalization 5/30/95 
AD, A, FH, MFS, IgE = 12341 ng/ml 


4/9 


Yes 


HS06 


Most recent hospitalization 6/13/95 
AD,AFH, MFS IgE = N/A 


5/20,4/13,5/12 


Yes 


HS07 


Not potato allergic, allergic to egg, milk, 
peanuts, seafood. AD, A, AR, FH, MFS 


High IgE control serum, not allergic to potato. 


HS08 


Non-atopic (normal) 


Low IgE control serum 



AD= Atopic dermatitis; FH= Food hypersensitivity; AR = Allergic rhinitis; A= 
Asthma; MFS= Multiple food sensitivity; N/A = not available. 

Example 2: Westem blotting of patatin proteins 

Western blotting experiments were performed using patatin protein purified to 
near homogeneity from com plants genetically engineered to produce patatin, patatin 
producing crude genetically engineered com leaf extracts, cmde potato tuber extracts, 
and non-transgenic com leaf samples. 

Protein samples were electrophoresed by SDS-PAGE (Laemmli, U.K., Nature 
227: 680-685, 1970) and were electroblotted onto nitrocellulose. Protein blots were 
processed by standard Westem blotting (immunoblotting) techniques and were incubated 
in potato allergic serum diluted 1 :5 in PBS buffer for 1 hour. After washing the blots 3 
times with PBS, the blots were incubated in biotinylated anti-IgE (Johns Hopkins 
Hospital, Baltimore, MD) for 1 hour, followed by a 30 minute incubation in HRP-linked 
avidin(Promega, New York, NY). IgE-reactive protein bands were visualized by DAB 
staining (3,3 diaminobenzidine). The blots were dried and photographed. Individual 
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blots are labeled according to patient serum used. As a control, one blot was incubated in 
anti-IgE only. 

Patatins were shown to be an allergen of potato by examining the reactivity of 
purified patatin to sera obtained from patients allergic to potato. Sera from five potato 
allergic subjects were tested by Western blotting techniques. All five sera reacted with 
purified patatin protein. 

Patatin isozymes (SEQ ID NOS:278-282, Figure 1) were tested for IgE binding 
by Western blotting. Isozymes of patatin were cloned into a yeast expression system and 
purified prior to analysis. The isozymes were subjected to IgE western blotting as 
described above with the exception that all five patient sera were pooled. The resulting 
Western blot of the yeast-expressed isozymes showed that all five isozymes bound IgE in 
a manner similar to patatin 17, and that all isozymes of patatin tested are also allergens. 

Example 3 : Western blotting of patatin proteins 

Eighty-nine 1 0-mer peptides were synthesized using the Genosys SPOTs system, 
each consecutive 1 0-mer overlapping by 6 amino acids based on the amino acid sequence 
of patatin 17 (SEQ ID N0:2). The peprides were evaluated for IgE binding with five 
different potato allergic patient sera using the same incubation procedures as described 
above. The results are summarized graphically in Figure 2, showing major and minor 
allergenic epitopes. Interestingly, many of the immunogenic epitopes contain tyrosine. 
The peptide numbers, sequences, and immunoreactivity is detailed in Table 3. 



Table 3: Peptide scan of patatin 17 



[Peptide # 
(SEQ ID 
NO) 


Peptide Sequence 


HSOl 


HS02 


HS03 


HS05 


HS06 


Cumulative 1 
Total 


1 (16) 


QLGEMVTVLS 


0.47 


0.33 


0.02 


0.05 


0.06 


0.93 


2(17) 


MVTVLSIDGG 


0.53 


0.33 


0.02 


0.07 


0.05 


1 


3(18) 


LSIDGGGIRG 


0.52 


0.38 


0.07 


0.08 


0.09 


1.14 


4(19) 


GGGIRGIIPA 


0.53 


0.19 


0.06 


0.19 


0.23 


1.2 


5(20) 


RGIIPATILE 


0.46 


0.28 


0.04 


0.09 


0.05 


0.92 


6(21) 


PATILEFLEG 


0.49 


0.31 


0.05 


0.09 


0.07 


1.01 


7(22) 


LEFLEGQLQE 


0.36 


0.24 


0.04 


0.1 


0.06 


0.8 


8(23) 


EGQLQEMDNN 


0.29 


0.19 


0.02 


0.09 


0.05 


0.64 


9(24) 


QEMDNNADAR 


0.22 


0.13 


0.01 


0.05 


0.04 


0.45 


1 10(25) 


NNADARLADY 


0.21 


0.17 


0.03 


0.05 


0.07 


0.53 
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1 11(26) 


ARLADYFDVI 


0.54 


0.31 


0.16 


0.15 


0.25 


1.41 


12(27) 


DYFDVIGGTS 


0.61 


0.34 


0.46 


0.06 


0.15 


1.62 


13(28) 


VIGGTSTGGL 


0.63 


0.72 


0.05 


0.15 


0.09 


1.64 


14(29) 


TSTGGLLTAM 


0.3 


0.17 


0.03 


0.06 


0.09 


0.65 


15(30) 


GLLTAMISTP 


0.63 


0.41 


0.05 


0.24 


0.12 


1.45 


16(31) 


AMISTPNENN 


0.34 


0.18 


0.02 


0.07 


0.02 


0.63 


17(32) 


TPNENNRPFA 


0.46 


0.22 


0.03 


0.19 


0.07 


0.97 


18(33) 


NNRPFAAAKE 


0.37 


0.21 


0.05 


0.07 


0.06 


0.76 


19(34) 


FAAAKEIVPF 


0.52 


0.29 


0.08 


0.11 


0.08 


1.08 


20 (35) 


KEIVPFYFEH 


0.29 


0.14 


0.28 


0.29 


0.23 


1.23 


21 (36) 


PFYFEHGPQI 


0.65 


0.06 


1.08 


0.51 


0.17 


2.47 


22 (37) 


EHGPQIFNPS 


0.34 


0.15 


0.03 


0.05 


0.06 


0.63 


23 (38) 


QIFNPSGQIL 


0.33 


0.29 


0.02 


0.07 


0.07 


0.78 


24 (39) 


PSGQILGPKY 


0 


0 


0.02 


0 


0.05 


0.07 


25 (40) 


ILGPKYDGKY 


0 


0 


0.07 


0 


0.02 


0.09 


26 (41) 


KYDGKYLMQV 


0.02 


0 


0.11 


0.01 


0.04 


0.18 


27 (42) 


KYLMQVLQEK 


0.12 


0.04 


1.08 


0.07 


0.79 


2.1 


28 (43) 


QVLQEKLGET 


0.46 


0.16 


0.01 


0.07 


0.02 


0.72 


29 (44) 


EKLGETRVHQ 


0.5 


0.12 


0.01 


0.07 


0.04 


0.74 


30 (45) 


ETRVHQALTE 


0.42 


0.16 


0.03 


0.05 


0.03 


0.69 


31 (46) 


HQALTEVVIS 


0.43 


0.21 


0.04 


0.1 


0.05 


0.83 


32 (47) 


TEVVISSFDI 


0.44 


0.25 


0.05 


0.08 


0.04 


0.86 


33 (48) 


ISSFDIKTNK 


0.1 


0.02 


0.04 


0.06 


0.13 


0.35 


34 (49) 


DIKTNKPVIF 


0.57 


0.22 


0.04 


0.18 


0.28 


1.29 


35 (50) 


NKPVIFTKSN 


0 


0.01 


0.02 


0.07 


0.24 


0.34 


36(51) 


IFTKSNLANS 


0 


0 


0.03 


0.06 


0.17 


0.26 


37 (52) 


SNLANSPELD 


0.43 


0.96 


0.01 


0.09 


0.02 


1.51 


38 (53) 


NSPELDAKMY 


0.18 


0.12 


0.01 


0.05 


0.05 


0.41 


39 (54) 


LDAKMYDISY 


0.54 


0.26 


0.19 


0.15 


0.23 


1.37 


40 (55) 


MYDISYSTAA 


0.92 


0.08 


0.52 


0.04 


0.22 


1.78 


41 (56) 


SYSTAAAPTY 


1.15 


0.25 


1.04 


0.33 


0.55 


3.32 


42 (57) 


AAAPTYFPPH 


1.02 


0.52 


1.12 


0.81 


0.86 


4.33 


43 (58) 


TYFPPHYFVT 


0.02 


0.01 


0.54 


0.03 


0.24 


0.84 


44 (59) 


PHYFVTNTSN 


0.03 


0.01 


1.17 


0.13 


0.44 


1.78 


45 (60) 


VTNTSNGDEY 


0.23 


0.15 


0.04 


0.03 


0.03 


0.48 


46(61) 


SNGDEYEFNL 


0.33 


0.25 


0.08 


0.1 


0.11 


0.87 


47(62) 


EYEFNLVDGA 


0.34 


0.25 


0.07 


0.1 


0.2 


0.96 


48 (63) 


NLVDGAVATV 


0.3 


0.18 


0.02 


0.06 


0.05 


0.61 


49(64) 


GAVATVADPA 


0.45 


0.54 


0.01 


0.07 


0.02 


1.09 


50 (65) 


TVADPALLSI 


0.48 


0.29 


0.01 


0.07 


0.03 


0.88 


51 (66) 


PALLSISVAT 


0.65 


0.33 


0.02 


0.1 


0.01 


1.11 


52 (67) 


SISVATRLAQ 


0.61 


0.23 


0.14 


0.53 


0.53 


2.04 


53 (68) 


ATRLAQKDPA 


0.87 


0.34 


0.05 


0.29 


0.22 


1.77 


54(69) 


AQKDPAFASI 


0.86 


0.32 


0.04 


0.12 


0.03 


1.37 


55 (70) 


PAFASIRSLN 


0.81 


0.15 


0.05 


0.51 


0.59 


2.11 


56(71) 


SIRSLNYKKM 


0.07 


0.01 


0.17 


0.07 


0.11 


0.43 


57 (72) 


LNYKKMLLLS 


0.05 


0.01 


0.35 


0.08 


0.39 


0.88 


58 (73) 


KMLLLSLGTG 


1.15 


0.15 


0.04 


0.38 


0.71 


2.43 


59(74) 


LSLGTGTTSE 


0.34 


0.23 


0.02 


0.04 


0.03 


0.66 


60(75) 


TGTTSEFDKT 


0.92 


0.39 


0.6 


0.1 


0.09 


2.1 


61 (76) 


SEFDKTYTAK 


1.33 


1.35 


1.41 


0.12 


0.28 


4.49 


1 62(77) 


KTYTAKEAAT 


1.36 


0.94 


1.11 


0.76 


0,4 


4.57 
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63 (78) 


AKEAATWTAV 


0.45 


0.15 


0.01 


0.2 


0.04 


0.85 1 


64(79) 


ATWTAVHWML 


0.1 


0.02 


0.01 


0.08 


0.06 


0.27 


65 (80) 


AVHWMLVIQK 


0.69 


0.05 


0.03 


0.43 


0.62 


1.82 


66 (81) 


MLVIQKMTDA 


0.32 


0.15 


0.02 


0.15 


0.03 


0.67 i 


67 (82) 


QKMTDYYLST 


0.26 


0.125 


0.03 


0.21 


0.05 


0.675 


68 (83) 


DAASSYMTDY 


0.2 


0.14 


0.08 


0.08 


0.1 


0.6 


69 (84) 


SYMTDYYLST 


0.5 


0.03 


0.32 


0.06 


0.11 


1.02 


70 (85) 


DYYLSTAFQA 


0.14 


0 


0.22 


0.03 


0.13 


0.52 


71 (86) 


STAFQALDSK 


0.4 


0.3 


0.04 


0.06 


0.08 


0.88 


72 (87) 


QALDSKNNYL 


0.44 


0.46 


0.28 


0.26 


0.43 


1.87 


73 (88) 


SKNNYLRVQE 


0.44 


0.05 


1.31 


0.07 


0.21 


2.08 


74 (89) 


YLRVQENALT 


1.38 


0.03 


1.31 


0.11 


0.2 


3.03 


75 (90) 


QENALTGTTT 


0.47 


0.25 


0 


0.06 


0 


0.78 


76 (91) 


LTGTTTEMDD 


0.41 


0.24 


0 


0.06 


0 


0.71 


77 (92) 


TTEMDDASEA 


0.38 


0.3 


0 


0.05 


0 


0.73 


78 (93) 


DDASEANMEL 


0.44 


0.24 


0 


0.06 


0 


0.74 


79 (94) 


EANMELLVQV 


0.42 


0.27 


0 


0.04 


0 


0.73 


80 (95) 


ELLVQVGENL 


0.4 


0.25 J 


0 


0.05 


0 


0.7 


81 (96) 


QVGENLLKKP 


0.44 


0.14 


0 


0.07 


0 


0.65 


82 (97) 


NLLKKPVSED 


0.47 


0.2 


0 


0.03 


0 


0.7 


83 (98) 


KPVSEDNPET 


0.27 


0.21 


0 


0.03 


0 


0.51 


84 (99) 


EDNPETYEEA 


0.13 


0.11 


0 


0.01 


0 


0.25 


85(100) 


ETYEEALKRF 


1.26 


1.2 


1.36 


0.53 


0.71 


5.06 


86(101) 


EALKRFAKLL 


1.38 


0.04 


0 


1.06 


1.12 


3.6 


87 (102) 


RFAKLLSDRK 


0.98 


0.05 


0 


0.84 


0.94 


2.81 


88(103) 


LLSDRKKLRA 


0.2 


0.01 


0 


0.37 


0.51 


1.09 i 


89(104) 


RKKLRANKAS 


0.28 


0 


0 


0.31 


0.64 


1.23 




















Patient 
Cumulative Totals 


41.84 


20.565 


18.1 


14.17 


16.55 





2 Example 4: Identification of result effective substitutions 

3 For each major and several minor allergenic epitopes of patatin, result effective 

4 substitutions were identified by synthesizing peptides that were altered by individually 

5 substituting an alanine residue at each non-alanine position in the epitope. Similarly, the 

6 reported nucleic acid sequence encoding com patatin (U.S. Patent No. 5,882,668; clone 

7 5c9) was evaluated for IgE binding by producing peptides at corresponding positions to 

8 the potato patatin protein. 

9 For example, Epitope 41 was analyzed by alanine scanning and rational 

10 substitution as follows. 

11 Epitope 41 SEFDKTYTAK (SEQ ID NO: 76) 

12 Alanine scan AEFDKTYTAK (SEQ ID NO: 16 5) 
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1 




SAFDKTYTAK 


(SEQ 


ID 


NO 


166) 


2 




SEADKTYTAK 


(SEQ 


ID 


NO 


167) 


3 




SEFAKTYTAK 


(SEQ 


ID 


NO 


168) 


4 




SEFDATYTAK 


(SEQ 


ID 


NO 


169) 


5 




SEFDKAYTAK 


(SEQ 


ID 


NO 


170) 


6 




SEFDKTATAK 


(SEQ 


ID 


NO 


171) 


7 




SEFDKTYAAK 


(SEQ 


ID 


NO 


172) 


8 




SEFDKTYTAA 


(SEQ 


ID 


NO 


173) 


9 


Rational substitution 


AFFDKTYTAK 


(SEQ 


ID 


NO 


283) 


10 




SEFDKTFTAK 


(SEQ 


ID 


NO 


176) 


11 


Com homolog 


CIFDSTYTAK 


(SEQ 


ID 


NO 


284) 



13 Selected epitopes were analyzed by alanine scanning and rational substitution. 

14 Immunoassay with potato-allergic serum was used as described above. Table 4 

15 summarizes the results of these experiments to identify result effective substitutions for 

16 patatin. Blank spaces in the table indicate that binding of the peptide to patient IgE was 

17 not detectable. 

18 Table 4: Scaiming of patatin for resuh effective substitutions 







Binding of modified peptides by patient IgE as measured by OD || 


Sequence 


SEQ ID NO 


HS03 


HS06 


HSOl 


HS02 II 


DYFDVIGGTS 


105 




0.12 


0.16 


0.36 


DYFDVIAGTS 


106 




0.14 


0.17 


0.4 


VIGGTSTGGL 


107 








0.04 


VIAGTSTGAL 


108 










AFYFEHGPQl 


109 




0.96 


0.5 


0.78 


PAYFEHGPQI 


110 




0.75 


0.41 


0.69 


PFAFEHGPQI 


111 










PFYAEHGPQI 


112 




0.7 


0.43 


0.79 


PFYFAHGPQl 


113 


0.93 


1.07 


0.59 


1.44 


PFYFEAGPQl 


114 


0.08 


0.93 


0.65 


1.34 


PFYFEHAPQI 


115 




0.75 


0.54 


1.11 


PFYFEHGAQI 


116 




0.63 


0.29 


0.6 


PFYFEHGPAI 


117 




0.63 


0.25 


0.56 


PFYFEHGPQA 


118 




0.27 


0.16 


0.33 


TFYLENGPKI 


119 


0.05 


0.48 


0.68 


1.07 


PFFFEHGPQI 


120 










AYLMQVLQEK 


121 




0.26 


0.11 


0.53 


KALMQVLQEK 


122 
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KYAMQVLQEK 


123 


0.11 


I 0.43 


0.1 


1.25 1 


KYLAQVLQEK 


124 


0.22 


0.48 


0.11 


1.34 


KYLMAVLQEK 


125 


0.22 


0.83 


0.16 


1.33 


KYLMQALQEK 


126 


0.11 


0.6 


0.15 


0.95 


KYLMQVAQEK 


127 




0.53 


0.15 


0.81 


KYLMQVLAEK 


128 


0.06 


0.69 


0.11 


1.34 1 


KYLMQVLQAK 


129 


0.74 


0.79 


0.05 


0.58 


KYLMQVLQEA 


130 




0.28 


0.27 


0.37 


VFLHDKIKSL 


131 


0.06 


0.26 




0.41 


Uystaaapty 


132 




0.1 


0.12 


0.12 


SASTAAAPTY 


133 










syataaapty 


134 




0.16 


0.13 


0.37 


SYSAAAAPTY 


135 




0.13 


0.12 


0.32 


SYSTAAAATY 


136 




0.15 


0.13 


0.34 


SYSTAAAPAY 


137 




0.15 


0.14 


0.29 


SYSTAAAPTA 


138 




0.55 


0.54 


1.13 


CISTSAAPTY 


139 


0.4 








SYSTAAAPAF 


140 


0.39 


1.02 


0.65 


1.42 


AFAAAAAPTY 


141 








0.07 


SYSTAAAPTF 


142 


0.15 


0.97 


0.48 


1.09 


STSAAPTYFP 


143 




0.21 


0.23 


0.39 


STSAAPTFFP 


144 








0.23 


STSAAPTAFP 


145 








0.08 


STAAAPTFFP 


146 






0.12 


0.28 


AAAATYFPPH 


147 




0.13 


0.1 


0.05 


AAAPAYFPPH 


148 






0.07 


0.04 


jAAAPTAFPPH 


149 










AAAPTYAPPH 


150 




0.23 


0.14 


0.21 


AAAPTYFAPH 


151 




0.45 


0.18 


0.44 


Uaaptyfpah 


152 




0.15 


0.07 


0,18 


[aaaptyfppa 


153 




0.1 


0.06 


0.31 


saaptyfpah 


154 




0.77 


0.73 


0.96 


Uaapaffpph 


155 










aaappffpph 


156 










AAAPTFFPPH 


157 










SISVATRLAQ 


158 






0.26 


0.26 


AMSMLTKEVH 


159 










jPAFASIRSLN 


160 










PNFNAGSPTE 


161 










KMLLLSLGTG 


162 










nyliisvgtg 


163 


0.49 


1.08 


0.64 


1.48 


KMLLLSLGAG 


164 




0.13 






Iaefdktytak 


165 


0.09 


0.22 




1.34 


SAFDKTYTAK 


166 


0.66 


0.71 


0.06 


1.42 


SEADKTYTAK 


167 








0.99 


SEFAKTYTAK 


168 


0.5 


0.57 




0.91 1 


SEFDATYTAK 


169 








0.17 


SEFDKAYTAK 


170 


0.1 


0.24 




1.38 


SEFDKTATAK 


171 








0.81 


ISEFDKTYAAK 


172 


0.2 


0.35 




1.39 


ISEFDKTYTAA 


173 






0.1 


1.18 


Ikqaekytaeq 


174 






0.08 


0.24 1 
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SEFDAAFAAA 


175 










SEFDKTFTAK 


176 


0.09 


0.16 


0.07 


1.45 


AEKYTAEQCA 


177 










ATYTAKEAAT 


178 




0.24 




0.18 


KAYTAKEAAT 


179 




0.28 




0.33 


KTATAKEAAT 


180 










KTYAAKEAAT 


181 


0.1 


0.32 




0.73 


KTYTAAEAAT 


182 








0.35 


KTYTAKAAAT 


183 


0.4 


0.59 




0.82 


KTYTAKEAAA 


184 








0.36 


EKYTAEQCAK 


185 










AAFAAAEAAT 


186 










KTFTAKEAAT 


187 










QALHCEKKYL 


188 










QALDSKAAYL 


189 










QALDSKNNFL 


190 










QALHCENNFL 


191 










CEKKYLRIQD 


192 


1.01 


0.16 






SKNNFLRVQE 


193 










SENNYLRVQE 


194 


0.31 


0.96 


0.42 


1 


ALRVQENALT 


195 










YARVQENALT 


196 


1.06 


1.02 


0.05 


0.54 


YLAVQENALT 


197 


0.37 


1.04 


0.11 


1.06 


YLRAQENALT 


198 


1.1 


1 


0.06 


1.26 


YLRVAENALT 


199 


1.03 


0.92 


0.08 


1.26 


YLRVQANALT 


200 


1.05 


0.92 


0.06 


1.24 


YLRVQEAALT 


201 


0.93 


0.92 


0.07 , 


1.11 


YLRVQENAAT 


202 


0.94 


0.93 


0.04 


1.24 


YLRVQENALA 


203 


1.05 


0.96 


0.43 


1.16 


YLRIQDDTLT 


204 


1.07 


0.85 


0.39 


1.12 


YLTVAAAALT 


205 


1.05 


0.86 


0.28 


1.33 


FLRVQENALT 


206 










NNYLRVQENA 


207 


0.23 


0.88 


0.5 


1.17 


KKYLRIQDDT 


208 




0.26 


0.09 


0.37 


NNFLRVQENA 


209 










NAYLRVQENA 


210 


0.17 


1.02 


0.53 


1.06 


ATYEEAKLRF 


211 


0.26 


1.03 




0.65 


EAYEEALKRF 


212 


0.06 


0.43 




0.33 


ETAEEALKRF 


213 




1.04 






ETYAEALKRF 


214 


0.62 


1.02 




1.15 


ETYEAALKRF 


215 


1.06 


0.38 




0.89 


ETYEEAAKRF 


216 


0.08 


O.I 




0.9 


ETYEEALARF 


217 








0.11 


ETYEEALKAF 


218 








0.1 


ETYEEALKRA 


219 








0.1 


GTNAQSLADF 


220 










ETYEAALAAF 


221 


0.07 


0.78 


0.33 


0.77 


ETFEEALKRF 


222 










YEEALKTFAK 


223 


1.08 


0.85 


0.14 


1.46 


FEEALKRFAK 


224 


0.46 


0.72 




0.67 


|aalkrfakll 


225 


0.15 


0.17 






IIeaakrfakll 


226 


0.08 


0.33 




0.05 1 
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IIEALARFAKLL 


227 




0.09 




1 


EALKAFAKLL 


228 










EALKRAAKLL 


229 


0.08 


0.07 






EALKRFAALL 


230 










EALKRFAKAL 


231 


0.06 


0.09 




0.1 


EALKRFAKLA 


232 


0.06 






0.1 


QSLADFAKQL 


233 










AALAAFAKLL 


234 










LADFAKQLSD 


235 










DFAKQLSDER 


236 








0.17 1 


AFAALLSDRK 


237 











Result effective substitutions were identified by a reduction in IgE binding ability 
with respect to the non-substituted peptide sequence. Table 5 shows the identified result 
effective substitutions. Blank spaces in the table indicate that binding of the peptide to 
patient IgE was not detectable. Many substitutions of alanine or phenylalanine for the 
original tyrosine resulted in reduced or eliminated antibody binding. 



Table 5: Result effective substitutions of patatin 



Location 


Peptide 


(SEQ ID NO) 


HS03 


HS06 


HSOl 


HS02 1 


(SEQ ID NO) 






























Minor 


PFYFEHGPQI 


(36) 


1 


08 


0 


17 


0 


65 


0 


06 1 


Epitope 21 




A 
P 








(111) 
(r) (120) 




























A 


(118) 






0 


27 


0 


16 


0 


33 


Minor 


KYLMQVLQEK 


(42) 


1 


08 


0 


79 


0 


12 


0 


04 


Epitope 27 


:A- 








(122) 
(130) 
(C) (131) 






























0 
0 


28 
26 


0 


27 


0 
0 


37 
41 




VFLHDKIKSL 


0 


06 


Major 


SYSTAAAPTY 


(56) 


1 


04 


0 


55 


1 


15 


0 


25 


Epitope 41 












(132) 
(133) 
(r) (141) 
(c) (139) 






0 


. 1 


0 


12 


0 


12 
















AFAA- • • 


















0. 


007 




CI: :S: : 






0. 


04 










Overlap 


STAAAPTYFP 


(238) 


















Epitope 
41/42 




S 


: : -A 




(r) (145) 














0 


08 


























Major 


AAAPTYFPPH 


(57) 


1 


12 


0 


86 


1 


02 


0 


52 1 


Epitope 4 2 






A: : 






(148) 










0 


07 


0 


04 1 


(57) 






:A: 
AF: 
PF: 
:F: 






(149) 
(r) (155) 
(r) (156) 
(r) (157) 


















Major 


SEFDKTYTAK 


(76) 


0 


12 


0 


28 


1 


33 


1 


35 


Epitope SI 






A: : 






(169) 














0 


17 1 




























KQAE : YTAEQ 


(c) (174) 










0 


08 


0 


.24 








AAFA 


A 


(r) (175) 


















II Major 


KTYTAKEAAT 


(77) 


1 


.11 


0 


04 


1 


36 


0 


.94 1 
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Minor 
Epitope 72 



Minor 
epitope 73 

I Minor 
epitope 74 

Overlap 
epitope 

73/74 

Major 
epitope 85 



Major 
Epitope 86 



Epitope 
overlap 
86/87 
Major 
Epitope 87 



QALDSKNNYL 
:HCEKK: : 
: : : :AA: : 



(178) 
(180) 
(182) 
(r) (186) 
(r) (187) 
(c) (185) 

(87) 
(c) (188) 
(r) (189) 
(r) (190) 
(r) (240) 



SKNNYLRVQE 



YLRVQENALT 



NNYLRVQENA 



(88) 
(r) (193) 
(87) 
(195) 
(r) (206) 
(207) 
(r) (209) 



ETYEEALKRF 



F: :::::: (r! 
G:NAQS:AD: (c 
EALKRFAKLL 



(100) 
(217) 
(218) 
(219) 
) (222) 
) (220) 



(101) 
(227) 
(228) 
(229) 
(230) 
(231) 
(232) 
C) (241) 
r) (234) 
LKRFAKLLSD (23 9) 

(NO BINDING) 

RFAKLLSDRK (102) 

:Q: : : ER (c) (236) 

A: : : : : : (r) (237) 

rational; (c) = corn. 



0.06 
0.06 



1.12 
0.09 



0.07 
0.09 



Example 5: Site directed mutagenesis 

To introduce site specific mutations, the cloned DNA sequence of patatin (SEQ 
ID N0:1 encoding patatin protein SEQ ID N0:2; pMON 26820) was subjected to PGR 
with primers SEQ ID NO: 3 and SEQ ID N0:4 to incorporate part of the a-factor signal 
sequence {Pichia expression manual, Invitrogen, Carlsbad, CA), and EcoRJ and Xhol 
restriction sites to facilitate cloning into the Pichia pastoris yeast secretion vector pPIC9 
(GenBank accession number Z46233; Invitrogen, Carlsbad, CA). Typical PCR 
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conditions are 25 cycles 94°C denaturation for 1 minute, 45°C annealing for one minute 
and 72°C extension for 2 minutes; plus one cycle IT'C extension for 10 minutes. A 50 
|iL reaction contains 30 pmol of each primer and 1 |ig of template DNA; and 1 X PCR 
buffer with MgC12, 200 \M dGTP, 200 dATP, 200 dTTP, 200 dCTP, 2.5 
units of Pwo DNA polymerase. PCR reactions are performed in RoboCycler Gradient 96 
Temperature Cycler (Stratagene, La Jolla, CA). 

The amplified iragment SEQ ID N0:5 was digested with restriction enzymes 
Xhol and EcoRI and cloned into the pBluescript vector (Stratagene, La Jolla, CA), 
digested with the same two restriction enzymes. The resulting plasmid (pMON 26869) 
was used for oligonucleotide-directed mutagenesis using the Bio-Rad mutagenesis kit 
based on the method of Kunkel {Proc. Natl. Acad. Sci. U.S.A., 82: 477-492, 1985). 
Briefly, single-stranded pMON26869 was used as template for mutagenesis and was 
prepared by superinfection of plasmid containing cells with M13K07 (Gorman, et al., 
DNA Prot. Eng. Techniques, 2: 3-10, 1990). The mutagenic oligonucleotides are SEQ ID 
NOS:8-15 (reverse complement). DNA purified fi-om transformed DH5a E. coli colonies 
was used for sequence determination. Sequencing was performed using the ABI PRISM 
sequencing kit (Perkin Elmer Biosystems, Foster City, CA). The resulting plasmid 
containing the mutation in the patatin gene was digested with restriction enzymes Xhol 
and EcoRI. 

The patatin nucleic acid Augment was then ligated into the pPIC9 vector 
(Invitrogen, Carlsbad, CA), digested with the same two restriction enzymes to afford 
plasmid pMON37401. Pichia pastoris KM71 cells were electroporated with 
pMON37401 containing the appropriate mutation. The resulting transformed cells were 
used to produce protein in Pichia pastoris using the procedure supplied by the 
manufacturer (Invitrogen, Carlsbad, CA). The encoded protein contains an alpha factor 
signal cleavage site. Plasmid pMON37401 encodes SEQ ID N0:6 which is cleaved to 
afford SEQ ID N0:7, having four amino acids added at the N-terminus of amino acids 
24-386 of SEQ ID N0:2. Position four of SEQ ID N0:7 therefore corresponds to 
position 23 of SEQ ID N0:2. 

The concentration of patatin in the culture was determined using a patatin ELISA 
assay and the enzyme activity was measured using the method of Hofgen and Willraitzer 
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(Plant Science, 66: 221-230, 1990). The variants containing multiple mutations were 
further purified using Mono Q and hydrophobic interaction chromatography (HIC). Each 
culture was purified by first sizing on Amicon YMIO membranes (Millipore, Bedford, 
MA) to a >10 kDa fraction, followed by chromatography on the Mono Q HR 10/10 
column (Pharmacia, Piscataway, NJ). For chromatography on the Mono Q column, the 
samples were loaded on the column in 25 mM Tris pH 7.5 and eluted with a gradient of 
1.0 M KCl in 25 mM Tris pH 7.5. Fractions containing patatin protein were determined 
using SDS-PAGE. For chromatography on the HIC colimm, the appropriate fi-actions 
were pooled and dialyzed into 1 M ammonium sulfate in 25 mM Tris pH 7.5. The 
dialyzed sample was then loaded on 16/10 phenyl Sepharose column (Pharmacia, 
Piscataway, NJ) and eluted with a gradient of 25 mM Tris pH7.5. 

The protein concentration was determined using the Bradford method, using BSA 
as a standard. SDS-PAGE analysis showed that these proteins were essentially pure. The 
esterase activity of the newly formed variants are shown in Table 6. The activity was 
determined using p-nitrophenyl caprate substrate as described by Hofgen and Willmitzer 
(Plant Science, 66: 221-230, 1990). 



Table 6: Esterase activity of patatin mutants 



Variant 


Activity (mOD.min'Vg^) 


Wild type 


93.2 


Y106F 


51.1 


Y129F 


74.7 


Y185F 


85.6 


Y193F 


82.2 


Y185F/Y193F 


99.4 


Y270F 


163.4 


Y316F 


94.88 


Y362F 


130.7 


Y106F/Y129F/Y185F/Y193F/Y270F/Y316F/Y362F 


57.1 


Yl 85F/Y 1 93F/Y270F/Y3 1 6F/Y362F 





Patatin proteins having a phenylalanine substitution at each of the amino acid 
positions 106, 129, 185, 193, 270, 316 and 362 (numbers correspond to positions in SEQ 
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ID N0:2) of expressed SEQ ID N0:7 exhibit full enzyme activity. Proteins having 
muhiple substitutions also displayed full enzyme activity. 

In addition to nucleotide sequences encoding conservative amino acid changes 
within the fundamental polypeptide sequence, biologically functional equivalent 
nucleotide sequences include nucleotide sequences containing other base substitutions, 
additions, or deletions. These include nucleic acids containing the same inherent genetic 
information as that contained in the cDNA which encode peptides, polypeptides, or 
proteins conferring pathogen resistance the same as or similar to that of pathogen upon 
host cells and plants. Such nucleotide sequences can be referred to as "genetically 
equivalent modified forms" of the cDNA, and can be identified by the methods 
described herein. 

Mutations made in the cDNA, plasmid DNA, genomic DNA, synthetic DNA, or 
other nucleic acid encoding the deallergenized gene preferably preserve the reading 
fi-ame of the coding sequence. Furthermore, these mutations preferably do not create 
complementary regions that could hybridize to produce secondary mRNA structures, 
such as loops or hairpins, that would adversely affect mRNA translation. 

Although mutation sites can be predetermined, it is not necessary that the nature 
of the mutations per se be predetermined. For example, in order to select for optimum 
characteristics of mutants at a given site, random mutagenesis can be conducted at the 
target codon. 

Alternatively, mutations can be introduced at particular loci by synthesizing 
oligonucleotides containing a mutant sequence, flanked by restriction sites enabling 
ligation to fragments of the native cDNA sequence. Following ligation, the resulting 
reconstructed nucleotide sequence encodes a derivative form having the desired amino 
acid insertion, substitution, or deletion. 

Example 6: Construction of permutein sequences 

Nucleic acid sequences encoding permutein proteins having rearranged N- 
terminus/C-terminus protein sequences can be made by following the general method 
described by MuUins et al. (J. Am. Chem. Soc. 116: 5529-5533, 1994). The steps are 
shown in Figure 3. The Figure and the following Examples involve the design and use of 
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1 a linker region separating the original C-terminus and N-terminus, but the use of a linker 

2 is not a critical or required element of permutein design. 

3 Two sets of oligonucleotide primers are used in the construction of a nucleic acid 

4 sequence encoding a permutein protein. In the first step, oligonucleotide primers "new 

5 N-termini" and "linker start" are used in a PCR reaction to create amplified nucleic acid 

6 molecule "new N-termini fragment" that contains the nucleic acid sequence encoding the 

7 new N-terminal portion of the permutein protein, followed by the polypeptide linker that 

8 connects the C-terminal and N-terminal ends of the original protein. In the second step, 

9 oligonucleotide primers "new C-termini" and "linker end" are used in a PCR reaction to 

10 create amplified nucleic acid molecule "new C-termini fragment" that contains the 

1 1 nucleic acid sequence encoding the same linker as used above, followed by the new C- 

12 termini portion of the permutein protein. The "new N-termini" and "new C-termini" 

13 oligonucleotide primers are designed to include appropriate restriction enzyme 

14 recognition sites which assist in the cloning of the nucleic acid sequence encoding the 

15 permutein protein into plasmids. 

16 Any suitable PCR conditions and polymerase can be used. It is desirable to use a 

17 thermostable DNA polymerase with high fidelity to reduce or eliminate the introduction 

18 of sequence errors. Typical PCR conditions are 25 cycles 94°C denaturation for 1 

19 minute, 45°C armealing for one minute and 72°C extension for 2 minutes; plus one cycle 

20 72°C extension for 10 minutes. A 50 reaction contains 30 pmol of each primer and 1 

21 |ig of template DNA; and 1 X PCR buffer with MgCh, 200 dGTP, 200 \iM. dATP, 

22 200 [iM dTTP, 200 |aM dCTP, 2.5 units of Pwo DNA polymerase. PCR reactions are 

23 performed in RoboCycler Gradient 96 Temperature Cycler (Stratagene, La Jolla, CA). 

24 The amplified "new N-termini fi-agment" and "new C-termini fiagment" are 

25 annealed to form a template in a third PCR reaction to amplify the fiiU-length nucleic 

26 acid sequence encoding the permutein protein. The DNA fragments "new N-termini 

27 fragment" and "new C-termini fragment" are resolved on a 1% TAE gel, stained with 

28 ethidium bromide, and isolated using the QIAquick Gel Extraction Kit (Qiagen, Valencia, 

29 CA). These fragments are combined in equimolar quantities with oligonucleotide 

30 primers "new N-termini" and "new C-termini" in the third PCR reaction. The conditions 
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for the PGR are the same as used previously. PGR reaction products can be purified 
using the QIAquick PGR purification kit (Qiagen, Valencia, CA). 

Alternatively, a linker sequence can be designed containing a restriction site, 
allowing direct ligation of the two amplified PGR products. 

Example 7: Gonstruction of plasmid pMON 37402 

The patatin protein contains a trypsin protease sensitive site at the arginine amino 
acid at position 246, as determined by electrophoresis of a trypsin digest reaction. In 
order to determine if the exposed protease site is an antigenic epitope, a permutein was 
constructed using positions 246-247 as a breakpoint. 

The nucleic acid sequence encoding the permutein protein in plasmid pMON 
37402 was created using the method illustrated in Figure 3 and described in Example 6. 
Nucleic acid molecule "new N-termini fragment" was created and amplified from the 
sequence encoding patatin in plasmid pMON26820 using oligonucleotide primers 27 
(SEQ ID NO:242) and 48 (SEQ ID NO:243). Nucleic acid molecule "new C-termini 
fragment" was created and amplified from the sequence encoding patatin in plasmid 
pMON26820 using oligonucleotide primers 47 (SEQ ID NO:244) and 36 (SEQ ID 
NO:245). The fiiU-length nucleic acid molecule encoding the permutein protein was 
created and amplified firom annealed fragments "new N-termini fragment" and "new G- 
termini fragment" using oligonucleotide primers 27 (SEQ ID NO:242) and 36 (SEQ ID 
NO:245). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Xhol and EcoRI, and purified using the QIAquick PGR purification kit 
(Qiagen, Valencia, GA). Plasmid pMON 26869 (derivative of pPIC9, Invitrogen, 
Garlsbad, GA) was digested with restriction endonucleases Xhol and EcoRI, and gel 
purified, resulting in an approximately 2900 base pair vector fragment. The purified 
restriction fragments were combined and ligated using T4 DNA ligase. 

The ligation reaction mixture was used to transform E. coli sfrain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
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presence of the correct insert. The resuhing plasmid was designated pMON 37402 
(containing SEQ ID NO:246, encoding protein sequence SEQ ID NO:247). 

Example 8: Construction of plasmid pMON 37405 

Amino acids 201-202, near tyrosine 193, were chosen as a breakpoint for the 
construction of a permutein protein. 

The nucleic acid sequence encoding the permutein protein in plasmid pMON 
37405 was created using the method illustrated in Figure 3 and described in Example 6. 
Nucleic acid molecule "New N-termini fragment" was created and amplified fi-om the 
sequence encoding patatin in plasmid pMON26820 using oligonucleotide primers 48 
(SEQ ID NO:243) and 58 (SEQ ID NO:249). Nucleic acid molecule "New C-termini 
fragment" was created and amplified from the sequence encoding patatin in plasmid 
pMON26820 using oligonucleotide primers 47 (SEQ ID NO:244) and 59 (SEQ ID 
NO:249). The full-length nucleic acid molecule encoding the permutein protein was 
created and amplified fi-om annealed fragments "New N-termini fragment" and "New C- 
termini fragment" using oligonucleotide primers 58 (SEQ ID NO:248) and 59 (SEQ ID 
NO:249). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Xhol and EcoRI, and purified using the QIAquick PGR purification kit 
(Qiagen, Valencia, CA). Plasmid pMON 26869 (derivative of pPIC9, Invitrogen, 
Carlsbad, CA) was digested with restriction endonucleases Xhol and EcoRI, and gel 
purified, resulting in an approximately 2900 base pair vector fragment. The purified 
restriction fragments were combined and ligated using T4 DNA ligase. 

The ligation reaction mixture was used to transform E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
presence of the correct insert. The resulting plasmid was designated pMON 37405 
(containing SEQ ID NO:250, encoding protein sequence SEQ ID NO:251). 
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Example 9: Construction of plasmid pMON 37406 



Amino acids 183-184, adjacent to tyrosine 185, were chosen as a breakpoint for 
the construction of a permutein protein. 

The nucleic acid sequence encoding the permutein protein in plasmid pMON 
37406 was created using the method illustrated in Figure 3 and described in Example 6. 
Nucleic acid molecule "New N-termini fragment" was created and amplified from the 
sequence encoding patatin in plasmid pMON26820 using oligonucleotide primers 48 
(SEQ ID NO:243) and 60 (SEQ ID NO:252). Nucleic acid molecule "New C-termini 
fragment" was created and amplified from the sequence encoding patatin in plasmid 
pMON26820 using oligonucleotide primers 47 (SEQ ID NO:244) and 61 (SEQ ID 
NO:253). The full-length nucleic acid molecule encoding the permutein protein was 
created and amplified from annealed fragments "New N-termini fragment" and "New C- 
termini fragment" using oligonucleotide primers 60 (SEQ ID NO:252) and 61 (SEQ ID 
NO:253). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Xhol and EcoRI, and purified using the QIAquick PCR purification kit 
(Qiagen, Valencia, CA). Plasmid pMON 26869 (derivative of pPIC9, Invitrogen, 
Carlsbad, CA) was digested with restriction endonucleases Xhol and EcoRI, and gel 
purified, resulting in an approximately 2900 base pair vector fragment. The purified 
restriction fragments were combined and ligated using T4 DNA ligase. 

The ligation reaction mixture was used to transform E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
presence of the correct insert. The resulting plasmid was designated pMON37406 
(containing SEQ ID NO:254, encoding protein sequence SEQ ID NO:255). 

Example 10: Construction of plasmid pMON 37407 

Amino acids 268-269, adjacent to tyrosine 270, were chosen as a breakpoint for 
the construction of a permutein protein. 
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The nucleic acid sequence encoding the permutein protein in plasmid pMON 

37407 was created using the method illustrated in Figure 3 and described in Example 6. 
Nucleic acid molecule "New N-termini fragment" was created and amplified fix)m the 
sequence encoding patatin in plasmid pMON26820 using oligonucleotide primers 48 
(SEQ ID NO:243) and 62 (SEQ ID NO:256). Nucleic acid molecule "New C-termini 
fragment" was created and amplified from the sequence encoding patatin in plasmid 
pMON26820 using oligonucleotide primers 47 (SEQ ID NO:244) and 63 (SEQ ID 
NO:257). The full-length nucleic acid molecule encoding the permutein protein was 
created and amplified from annealed fragments "New N-termini fragment" and "New C- 
termini fragment" using oligonucleotide primers 62 (SEQ ID NO:256) and 63 (SEQ ID 
NO:257). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Xhol and EcoRI, and purified using the QIAquick PCR purification kit 
(Qiagen, Valencia, CA). Plasmid pMON 26869 (derivative of pPIC9, Invitrogen, 
Carlsbad, CA) was digested with restriction endonucleases Xhol and EcoRI, and gel 
purified, resuUing in an approximately 2900 base pah- vector fragment. The purified 
restriction fragments were combined and ligated using T4 DNA ligase. 

The ligation reaction mixture was used to transform E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
presence of the correct insert. The resulting plasmid was designated pMON37407 
(containing SEQ ID NO:258, encoding protein sequence SEQ ID NO:259). 

Example 1 1 : Construction of plasmid pMON 37408 

Amino acids 321-322, near tyrosine 216, were chosen as a breakpoint for the 
construction of a permutein protein. 

The nucleic acid sequence encoding the permutein protein in plasmid pMON 

37408 was created using the method illustrated in Figure 3 and described in Example 6. 
Nucleic acid molecule "New N-termini fragment" was created and amplified from the 
sequence encoding patatin in plasmid pMON26820 using oligonucleotide primers 48 
(SEQ ID NO:243) and 64 (SEQ ID NO:260). Nucleic acid molecule "New C-termini 
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fragment" was created and amplified from the sequence encoding patatin in plasmid 
pMON26820 using oligonucleotide primers 47 (SEQ ID NO:244) and 65 (SEQ ID 
NO:261). The full-length nucleic acid molecule encoding the permutein protein was 
created and amplified from annealed fragments "New N-termini fragment" and "New C- 
termini fragment" using oligonucleotide primers 64 (SEQ ID NO:260) and 65 (SEQ ID 
NO:261). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Xhol and EcoRI, and purified using the QIAquick PGR purificafion kit 
(Qiagen, Valencia, CA). Plasmid pMON 26869 (derivative of pPIC9, Invitrogen, 
Carlsbad, CA) was digested with restriction endonucleases Xhol and EcoRI, and gel 
purified, resulting in an approximately 2900 base pair vector fragment. The purified 
restriction fragments were combined and ligated using T4 DNA ligase. 

The ligation reaction mixture was used to transfr)rm E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
presence of the correct insert. The resulting plasmid was designated pMON37408 
(containing SEQ ID NO:262, encoding protein sequence SEQ ID NO:263). 

Example 12: Production of permutein proteins in Pichia pastoris 

Plasmids pMON37402, pMON37405, pMON37406, pMON37407, and 
pMON37408 were individually used to electroporate KM71 cells from Pichia pastoris 
according to the procedure supplied by the manufacturer (Invitrogen, Carlsbad, CA). The 
resulting transformed cells were used to produce protein in Pichia pastoris following the 
procedure supplied by the manufacturer (Invitrogen, Carlsbad, CA). 

The concentration of patatin in the culture was determined using a patatin ELISA 
assay and the enzyme activity was measured using the method of Hofgen and Willmitzer 
{Plant Science, 66: 221-230, 1990). The variants containing multiple mutations were 
further purified using Mono Q and hydrophobic interaction chromatography (HIC). Each 
culture was purified by first sizing on YMIO membranes (Amicon, MA) to a [>10 kDa] 
fraction, followed by chromatography on the Mono Q HR 10/10 column (Pharmacia, NJ). 
For chromatography on the Mono Q colunrn, the samples were loaded on the colunm in 
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1 25 mM Tris pH 7.5 and eluted with a gradient of 1.0 M KCl in 25 mM Tris pH 7.5. 

2 Fractions containing patatin protein were determined using SDS-PAGE. For 

3 chromatography on the HIC column, the appropriate fractions were pooled and dialyzed 

4 into 1 M ammonium sulfate in 25 mM Tris pH 7.5. The dialyzed sample was then loaded 

5 on 16/10 phenyl Sepharose column (Pharmacia, NJ) and eluted with a gradient of 25 mM 

6 Tris pH7.5. 

7 The protein concentration was determined using the Bradford method, using BSA 

8 as a standard. SDS-PAGE analysis showed that these proteins were essentially pure. The 

9 esterase activity of the variants are shown in Table 7. 

10 Table 7: Activity of permuteins 



pMON 


Breakpoint 


Activity (AOD min'Vg"') 


Native en2yme 




83.21 


pMON37402 


246/247 


66.7 


pMON37405 


201/202 


No expression 


pMON37406 


183/184 


No expression 


pMON37407 


268/269 


12.1 


pMON37408 


321/322 


No expression 



11 

12 The activity was determined using /?-nitrophenyl caprate substrate as described by 

13 Hofgen and Willmitzer (Plant Science, 66: 22 1 -230, 1 990). 

14 Example 13: Insect bioefificacv assays 

15 Assays for activity against larvae of SCRW are carried out by overlaying the test 

16 sample on an agar diet similar to that described by Marrone (J. Econ. Entom. 78: 290- 

17 293, 1985). Test samples were prepared in 25 mM Tris, pH 7.5 buffer. Neonate larvae 

18 are allowed to feed on the treated diet at 26°C, and mortality and growth stunting were 

19 evaluated after 5 or 6 days. The results of this assay are shown in Table 8. 

20 Table 8: Insect bioefficacy assay 



1 Protein (200 ppm) 


Mean Survival Weight 


% Weight Reduction | 


1 Tris buffer (control) 


1.26 + 0.3 




[ Wild Type 


0.21 ± 0.02 


83 1 
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1 pMON37402 


0.21 ± 0.03 


83 1 


1 pMON37407 


0.32 ± 0.04 


75 



2 These data demonstrate that the growth of the SCRW larvae is similarly reduced 

3 upon ingestion of the proteins encoded by pMON37402 and pMON37407 as compared to 

4 the wild type patatin protein. 

5 Example 14: Permutein sequences improved for monocot expression 

6 Modification of coding sequences has been demonstrated above to improve 

7 expression of insecticidal proteins. A modified coding sequence was thus designed to 

8 improve expression in plants, especially com (SEQ ID NO:264). 

9 Example 15: Construction of pMON40701 for monocot expression 

10 Plasmid pMON 19767 was digested with restriction endonucleases Ncol and 

11 EcoRI and the 1 100 bp gene fragment was purified using the QIAquick PCR purification 

12 kit (Qiagen, Valencia, CA). Plasmid pMON33719 was digested with restriction 

13 endonucleases Ncol and EcoRI, and gel purified, resulting in an approximately 3900 base 

14 pair vector fragment. The two purified restriction fragments were combined and ligated 

15 using T4 DNA ligase. 

16 The ligation reaction mixture was used to transform E. coli strain DH5a cells 

17 (Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 

18 ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 

19 presence of the correct insert. The resulting plasmid was designated pMON40700. 

20 Plasmid pMON40700 was digested with restriction endonuclease NotI and the resulting 

21 2200 bp DNA fragment was purified using the QIAquick PCR purification kit (Qiagen, 

22 Valencia, CA). Plasmid pMON30460 was digested with restriction endonuclease NotI, 

23 and gel purified, resulting in an approximately 4200 base pair vector fragment. The two 

24 purified restriction fragments were combined and ligated using T4 DNA ligase. 

25 The ligation reaction mixture was used to transform E. coli strain DH5a cells 

26 (Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
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kanamycin-containing plates. The resulting plasmid was designated pMON40701 
(containing SEQ ID NO:264, encoding protein sequence SEQ ID NO:265). 

Example 16: Construction of pMON40703 for monocot expression 

The nucleic acid sequence encoding the permutein protein in plasmid 
pMON40703 was created using the method illustrated in Figure 3 and described in 
Example 6. Nucleic acid molecule "New N-termini fragment" was created and amplified 
from the sequence encoding patatin in plasmid pMON 19767 using oligonucleotide 
primers Synl (SEQ ID NO:266) and Syn2 (SEQ ID NO:267). Nucleic acid molecule 
"New C-termini fragment" was created and amplified from the sequence encoding patatin 
in plasmid pMON19767 using oligonucleotide primers Syn3 (SEQ ID NO:268) and Syn4 
(SEQ ID NO:269). The full-length nucleic acid molecule encoding the permutein protein 
was created and amplified from annealed fragments "New N-termini fragment" and 
"New C-termini fragment" using oligonucleotide primers Synl (SEQ ID NO:266) and 
Syn4(SEQ IDNO:269). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Ncol and EcoRI, and purified using the QIAquick PCR purification kit 
(Qiagen, Valencia, CA). Plasmid pMON33719 was digested with restriction 
endonucleases Ncol and EcoRI, and gel purified, resulting in an approximately 3900 base 
pair vector fragment. The purified restriction fragments were combined and ligated using 
T4 DNA ligase. 

The ligation reaction mixture was used to transform E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
presence of the correct insert. The resulting plasmid was designated pMON40702. 
Plasmid pMON40702 was digested with NotI, and the resulting 2200 bp DNA Segment 
was purified using the QIAquick PCR purification kit (Qiagen, Valencia, CA). Plasmid 
pMON30460 was digested with restriction endonuclease NotI, and gel purified, resulting 
in an approximately 4200 base pair vector fragment. The purified restriction fragments 
were combined and ligated using T4 DNA ligase. 
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The ligation reaction mixture was used to transform E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
kanamycin-containing plates. The resulting plasmid was designated pMON40703 
(containing SEQ ID NO:270, encoding protein sequence SEQ ID NO:27]). Plasmid 
pMON40703 encodes a permutein protein with a "breakpoint" at positions 246/247 of the 
wild type patatin protein sequence (SEQ ID N0:2). The first 23 amino acids of SEQ ID 
N0:2 are a signal peptide sequence which is cleaved in the mature protein. 

Example 17: Construction of pMON40705 for monocot expression 

The nucleic acid sequence encoding the permutein protein in plasmid 
pMON40705 was created using the method illustrated in Figure 3 and described in 
Example 6. Nucleic acid molecule "New N-termini fragment" was created and amplified 
from the sequence encoding patatin in plasmid pMON 19767 using oligonucleotide 
primers SynlO (SEQ ID NO:272) and Syn2 (SEQ ID NO:267). Nucleic acid molecule 
"New C-termini fragment" was created and amplified from the sequence encoding patatin 
in plasmid pMON 19767 using oligonucleotide primers Syn3 (SEQ ID NO:268) and 
Synl 1 (SEQ ID NO:273). The full-length nucleic acid molecule encoding the permutein 
protein was created and amplified from annealed fragments "New N-termini fragment" 
and "New C-termini fragment" using oligonucleotide primers SynlO (SEQ ID NO:272) 
and Synl 1 (SEQ ID NO:273). 

The resulting amplified nucleic acid molecule was digested with restriction 
endonucleases Ncol and EcoRI, and purified using the QIAquick PGR purification kit 
(Qiagen, Valencia, CA). Plasmid pMON33719 was digested with restriction 
endonucleases Ncol and EcoRI, and gel purified, resulting in an approximately 3900 base 
pair vector fragment. The purified restriction fragments were combined and li gated using 
T4 DNA ligase. 

The ligation reaction mixture was used to transform E. coli strain DH5a cells 
(Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on 
ampicillin-containing plates. Plasmid DNA was isolated and sequenced to confirm the 
presence of the correct insert. The resulting plasmid was designated pMON40704. 
Plasmid pMON40704 was digested with restriction endonuclease NotI, and the resulting 
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1 2200 bp DNA fragment was purified using the QIAquick PCR purification kit (Qiagen, 

2 Valencia, CA). Plasmid pMON30460 was digested with restriction endonuclease NotI, 

3 and gel purified, resulting in an approximately 4200 base pair vector Augment. The 

4 purified restriction fragments were combined and ligated using T4 DNA ligase. 

5 The ligation reaction mixture was used to transform E. coli strain DH5a cells 

6 (Life Technologies, Gaithersburg, MD). Transformant bacteria were selected on plates 

7 containing kanamycin. The resulting plasmid was designated pMON40705 (containing 
g SEQ ID NO:274, encoding protein sequence SEQ ID NO:275). Plasmid pMON40705 

9 encodes a permutein protein with a "breakpoint" at positions 268/269 of the wild type 

10 patatin protein sequence (SEQ ID N0:2). The first 23 amino acids of SEQ ID N0:2 are a 

1 1 signal peptide sequence which is cleaved in the mature protein. 

12 Example 1 8: Transient expression of protein in com leaf protoplasts 

13 Plasmids pMON40701 , pMON40703, and pMON40705 (all containing the native 

14 signal sequence for vacuolar targeting) were separately electroporated into com leaf 

15 protoplasts as described by Sheen {Plant Cell 3: 225-245, 1991). Protein was extracted 

16 with glass beads and the supematant was assayed for protein expression using ELISA for 

17 patatin and NPTII. Expression of protein by the transformed com protoplasts was 

18 confirmed by Western blot analysis. Expression results are shown in Table 9. 

19 Table 9: ELISA data 



Sample 


Patatin ELISA 


NPTII ELISA 


Normalized Expression 1 




(Hg/mL) 


(^g/mL) 


(Patatin ELISA/NPTII ELISA) 


pMON40701 


1.1 


0.6 


1.8 


pMON40703 


2.1 


0.3 


7.0 


pMON40705 


1.3 


0.6 


2^2 1 



20 The results indicate that the permutein encoded by plasmid pMON40703 

21 surprisingly shows approximately 4-fold higher expression compared to the wild type 

22 enzyme. 
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Example 19: Deglycosvlationof protein sequences 



This example provides evidence that glycosylation of can contribute to the 
allergenicity of a protein. Accordingly, rational substitution of amino acid residues likely 
to be the targets of glycosylation within a subject allergen protein may reduce or 
eliminate the allergenic properties of the protein without adversely affecting the 
enzymatic, insecticidal, antifungal or other functional properties of the protein. 

Glycosylation commonly occurs as either N-linked or 0-linked forms. N-linked 
glycosylation usually occurs at the motif Asn-Xaa-Ser/Thr, where Xaa is any amino acid 
except Pro (Kasturi, L. et al., Biochem J. 323: 415-519, 1997; Melquist, J.L. et al., 
Biochemistry 37: 6833-6837, 1998). 0-linked glycosylation occurs between the hydroxyl 
group of serine or threonine and an amino sugar. 

Site directed mutagenesis of selected asparagine, serine, or threonine may be used 
to reduce or eliminate the glycosylation of patatin proteins. A search of SEQ ID N0:2 
for the Asn-Xaa-Ser/Thr motif reveals one occurrence at amino acid positions 202-204. 
Mutagenization of the nucleic acid sequence encoding this region results in a reduced 
allergenicity of the encoded protein. 

In order to test this approach to reducing allergenicity of patatin proteins, two sets 
of experiments were performed: a) production of patatin proteins in Escherichia coli, 
which do not glycosylate proteins; and b) production of patatin proteins with an N202Q 
site directed mutation. 

Antibodies obtained from patients HS-07 and G15-M0N (not potato allergic) did 
not show specific binding to wild type patatin, patatin produced in E. coli, or the N202Q 
variant. Antibodies obtained from patient HS-01 (potato allergic) bound to wild type 
patatin, but not to patatin produced in E. coli or the N202Q variant. Antibodies obtained 
from patient HS-02 (potato allergic) bound strongly to wild type patatin, but extremely 
weakly to patatin produced in E. coli, and binding to the N202Q variant resembled vector 
controls. Antibodies obtained from patient HS-03 (potato allergic) bound to wild type 
patatin, but not to patatin produced in E. coli or the N202Q variant. Antibodies obtained 
from patient HS-05 (potato allergic) boimd to wild type patatin, but very weakly to 
patatin produced in E. coli and the N202Q variant. Antibodies obtained from patient HS- 
06 (potato allergic) strongly bound wild type patatin, the N202Q variant, and to patatin 
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produced in E. coli. These results strongly suggest that glycosylation is at least partially 
responsible for the antigenic properties of patatin proteins, and that site directed 
mutagenesis may be used to reduce or eliminate specific antibody binding. Mutagenesis 
at position 202 of SEQ ID N0:2 may be useful for reducing or eliminating specific 
antibody binding. 

The deglycosylation approach was also tested using a patatin homolog, Pat 17. As 
demonstrated above, patatin epitopes exhibiting IgE binding were identified, and each 
contained a Tyr residue. Substitution of these Tyr residues within each epitope led to loss 
of IgE binding. Site-directed mutagenesis was used to produce variants with individual 
and multiple Tyr substitutions in the protein, which was expressed in Pichia pastoris and 
assessed for enzyme activity. All the variants were found to have en2ymatic activity no 
less than the wild type protein. A single variant with all 5 tyrosine residues substituted 
with phenylalinine was found to have insecticidal activity no less than the unsubstituted 
protein and was expressed in E.coli to produce the non-glycosylated version. The E.coli 
5-"Tyr to Phe" variant was assessed for IgE binding. An isozyme of patatin, designated 
Patl7, was also expressed in com to produce a plant glycoprotein and in E.coli to 
produce a nonglycosylated protein. Sera of seven patients (five exhibiting potato allergy 
and one exhibiting other allergies but no allergy to potatoes) were were used to assay 
Pat 17 or Pat 17 variant binding by immunoblot assay. Four of the five sera from patients 
exhibiting potato allergy showed either very weak or no binding to wild type patatin 
expressed in E.coli but did bind to the 5 -Tyr variant. Serum firom one patient exhibiting 
potato allergy showed strong binding to recombinant wild type patatin protein expressed 
in E.coli but weak binding to the 5-Tyr variant. Sera from all five patients exhibiting 
potato allergy bound strongly to patatin expressed in com and native patatin present in 
potatoes. Serum fi-om a control patient allergic to eggs, milk, peanuts and seafood, but 
exhibiting no allergy to potatoes showed no binding to patatin expressed in E.coli but did 
bind to patatin expressed in com. Immunoblot results suggested that the sugar moiety in 
patatin is a non-specific IgE binding epitope and the polypeptide portion of patatin also 
contains immunogenic IgE epitopes. 
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Patients who suffer from potato allergy were identified at Johns Hopkins Clinic 
(Baltimore, MD) and were evaluated for potato allergy using clinical criteria outlined in 
Table 2. 

Serum was obtained from patients with convincing clinical history of potato 
allergy. The convincing history was defined as being one or more of the following: a) 
positive potato allergic reaction as evaluated by double-blind placebo-control food 
challenge b) anaphylaix and/or hospitalization due to the consumption of potatoes or c) 
dramatic skin test results. 
Peptide Synthesis 

Peptides were synthesized on cellulose membranes using the SPOTS system 
(Genosys Biotechnologies, TX). Membranes were stored at -20°C until use. 
Site directed Mutagenesis 

Site specific mutations were introduced into patatin by first incorporating part of 
the a-factor signal sequence (Pichia expression manual, Invitrogen, Carlsbad, CA) to the 
patatin gene using PCR. Primers used for the PCR were 

GGAGCTCGAGAAAAGAGAGGCTGAAGCTCAGTTGGGAGAAATGGTGACTGT 
TCT (Xhol site in italics) and GGTCTAGAG a4^7TCTCATTAATAAGAAG (EcoRl 
site in italics). The primers contained restriction sites to facilitate cloning into Pichia 
pastoris yeast secretion vector pPIC9 (GenBank accession number Z46233; Invitrogen, 
Carlsbad, CA). Typical PCR conditions are 25 cycles 94°C denaturation for 1 minute, 
45''C aimealing for one minute and 72°C extension for 2 minutes; plus one cycle 72°C 
extension for 10 minutes. A 50 mL reaction contained 30 pmol of each primer and 1 mg 
of template DNA; and 1 X PCR buffer with MgCb, 200 mM dGTP, 200 mM dATP, 200 
mM dTTP, 200 mM dCTP, 2.5 units of Pwo DNA polymerase. PCR reactions are 
performed in RoboCycler Gradient 96 Temperature Cycler (Stratagene, La Joila, CA). 

The amplified patatin gene was digested with restriction enzymes Xhol and EcoRl 
and cloned into the pBluescript vector (Stratagene, La Jolla, CA), digested with the same 
two restriction enzymes. The template plasmid DNA used for the PCR was 
pMON26820. The resulting plasmid (pMON 26869) was used for oligonucleotide- 
directed mutagenesis using the Bio-Rad mutagenesis kit based on the method of Kunkel 
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et al., Proc Natl Acad Sci USA 82, 477-92 (1985). Briefly, single-stranded pMON26869 
was used as template for mutagenesis and was prepared by superinfection of plasmid 
containing cells with M13K07 (Gorman et al., DNA and Protein Engineering techniques 
2, 3-10 (1990)). DNA purified from transformed DH5a E. coli colonies was used for 
sequence determination. Sequencing was performed using the ABI PRISM sequencing 
kit (Perkin Elmer Biosystems, Foster City, CA). 
Protein Expression in Pichia pastoris 

Plasmids containing the mutations in the patatin gene were digested with restriction 
enzymes Xhol and EcoRl. The patatin nucleic acid fragment was then ligated into the 
pPIC9 vector (Invitrogen, Carlsbad, CA), digested with the same two restriction enzymes 
to afford plasmid pMON37401. Pichia pastoris KM71 cells were electroporated with 
pMON37401 containing the appropriate mutation. The resulting transformed cells were 
used to produce protein in Pichia pastoris using the procedure supplied by the 
manufacturer (Invitrogen, Carlsbad, CA). The proteins were purified in the same way as 
the proteins expressed in E. coli (see below). 
Western Blotting of Proteins 

Protein samples were electrophoresed by SDS-PAGE and electroblotted onto PVDF 
membrane (Millipore, Bedford MA). Protein blots were processed by standard Western 
blotting (immimoblotting) techniques and were incubated in potato allergic serum diluted 
1 :5 in PBS buffer for 1 hour. After washing the blots 3 times with PBS, the blots were 
incubated in biotinylated anti-IgE (Johns Hopkins Hospital, Baltimore MD) for 1 hour, 
followed by a 30 minute incubation in HRP-linked avidin (Promega, New York, NY). 
IgE-reactive protein bands were visualized by using the ECL system (Amersham 
Pharmacia Biotech, NJ). As a control, one blot was incubated in anti-IgE only. His- 
tagged glyphosate oxidase and potato extracts was prepared and provided for this study 
by Regulatory Sciences, Monsanto Company. The peptides were evaluated using the 
same incubation procedures as described above. 
Expression and purification of patatin in com 

An isozyme of patatin, Pat 17, was generated for expression in com using a modified 
plant optimized gene sequence as described by Brown et al (US Patent 5,689,052). All 
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the constructs contained the native 23 amino acid signal peptide for vacuolar targeting. 
Com was transformed by microprojectile bombardment (Morrish et al., in Transgenic 
plants. Fundamentals and Applications (ed. Hiatt, A.) 133-171 (Marcel Dekker, New 
York, 1993); Songstad et al., In Vitro Cell Dev Biol - Plant 32, 179-183 (1996)). 
Protein from the transformed com plants was purified by first grinding the leaves in 
liquid nitrogen and extracting the protein usmg 25 mM Tris/HCl. The plant extract was 
further dialyzed against 25 mM Tris/HCI pH 7.5. The plant extract was then loaded onto 
Mono Q HR 10/10 anion-exchange column (Amersham Pharmacia, NJ) equilibrated with 
25 mM Tris/HCl pH 7.5 (buffer A). The protein was eluted with 25 mM Tris/HCl pH 
7.5, 1 M KCl (buffer B) using a linear gradient of 0-100% buffer B using an HPLC 
system (Shimadzu). Fractions containing protein were assayed for esterase activity and 
dialyzed against 25 mM Tris/HCl pH 7.5, 1 M Ammonium Sulfate (buffer C). The 
protein was purified to homogeneity by loading onto a phenyl-Sepharose 16/10 column 
(Amersham Pharmacia, NJ) equilibrated with buffer C. Esterase active fractions were 
pooled and dialyzed against 25 mM Tris/HCI pH 7.5. 
Expression and purification of patatin in E.coli 

Patl7 was expressed in E.coli using the pET expression system (Novagen, Wl). The 
coding region of the mature Pat 17 gene (without its signal peptide) was amplified by 
PCR using the primers 5'-GGGCC4rGGCGCAGTTGGGAGAAATGGTG-3' {Ncol site 
in italics) and 5'-AACAAAGCTTCTTATTGAGGTGCGGCCGCTTGCATGC-3' (Notl 
site in italics) using standard PCR reaction conditions as described in the Gene Amp kit 
(Perkin-Elmer Cetus, CT) and an annealing temperature of 40°C. The template was 
plasmid pMON26820. The resulting DNA was digested with Ncol and Notl and cloned 
into a modified pET24d plasmid, designed to add an N-terminal hexa-histidine tag to the 
protein. The correct sequence of the PCR product was verified by sequencing, and the 
plasmid was transformed into E.coli BL21 (DE3), and transformants selected on LB 
containing 25 mg/mL kanamycin. The expression strain was grown in LB containing 25 
mg/mL kanamycin and induced for 8 hrs at 28°C with 1 mM IPTG. Cells were harvested 
and washed in 50 mM Tris/HCl pH 8.5, 150 mM NaCl, and lysed by French Press at 
20,000 psi. His-tagged protein was recovered in the soluble fraction of lysed cells and 
subsequently purified using Ni-NTA resin as described in the QIAexpressionist manual 
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(Qiagen CA). The partially purified protein was then dialyzed against 25 mM Tris/HCl 
pH 7.5 (buffer A) and loaded onto Mono Q HR 10/10 anion-exchange column 
(Amersham Pharmacia, NJ) equilibrated with buffer A. The protein was eluted with 25 
mM Tris/HCl pH 7.5, 1 M KCl (buffer B) using a linear gradient of 0-100% buffer B run 
over 30 min at a flow rate of 4 mL/min using an HPLC system (Shimadzu). Fractions 
containing protein were assayed for esterase activity. Esterase active fractions were 
pooled, concentrated and dialyzed against 25 mM Tris/HCl pH 7.5 and stored at 4°C. 
Enzyme Activity Assays 

Enzyme activity was measured as described previously using /7-nitrophenyl caprate 
(Sigma, MO) as a substrate, dissolved in dimethylsulfoxide (5 mM stock solution) and 
diluted in 4% Triton X-100, 1% SDS to a final concentration of 1 mM. For the assay, 20 
mL of protein solution was added to a mixture of 25 mL of the 1 mM substrate solution 
and 80 mL of 50 mM Tris pH 8.5. The enzyme activity was monitored at 405 nm in 6 
sec interval for a period of 10 min. Esterase activity was expressed as DOD min'mg'' 
protein. 

Insect Bioassay 

The protein was also assayed for activity against larvae of Diabrotica virigifera (Western 
com rootworm) by overiaying the test sample on an agar diet similar to that described 
previously (Marrone et al., J. Econ. Entom. 78, 290-3 (1985)). Proteins to be tested were 
diluted in 25 mM Tris/HCl pH 7.5 and overlayed on the diet surface. Neonate larvae 
were allowed to feed on the diet and mortality and growth stunting were evaluated after 6 
days. 

IgE Binding Epitopes on Patatin 

A panel of eighty-nine overlapping peptides representing the amino acid sequence of 
patatin were synthesized to determine the regions responsible for IgE binding. Each 
peptide was 10 amino acids long and consisted of 6 amino acid overlap between the 
consecutive peptides. The peptides were evaluated for IgE binding with five different 
potato allergic patient sera. Patatin has 3 major epitopes. These major IgE binding 
regions represent amino acids 184-193, 188-197, 269-278 and 360-369. Other minor IgE 
binding regions represent amino acids 104-113, 138-147 and 316-325. The amino acids 
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essential for IgE binding in each major and minor epitopes were determined by 
synthesizing peptides with single amino acid changes at each position by individually 
substituting an alanine residue at each non-alanine position in the epitopes. The resulting 
alanine substituted peptides were evaluated for IgE binding. Result effective 
substitutions were identified by a reduction in IgE binding with respect to the non- 
substituted peptide sequence. It was very interesting to note that all the epitopes 
contained a Tyr residue and substitution of this Tyr for Ala or Phe eliminated IgE 
binding. 

Enzyme and Bioactivity 

The Tyr residues identified to be critical for IgE binding in each of the epitopes were 
substituted with Phe either individually or in concert using site-directed mutagenesis. All 
the variants were expressed in Pichia pastoris and assessed for enzyme activity and 
insecticidal activity. The variants included Y106F, Y129F, Y185F, Y193F, Y270F, 
Y316F, Y362F, Y185FA'193F, Y185FA^193FA^270F/Y316FA^362F (5-Tyr) and 
Y106F/Y129FA^185FA^193FA^270FA'316F/ Y362F (7-Tyr). All the variants 
maintained enzyme activity. The 5-Tyr and 7-Tyr variants were then assessed for 
insecticidal activity by overlaying protein (200 ppm final concentration). The proteins 
caused significant stunting of the larval growth as measured by the weight of the larvae 
after 6 days with the 5-Tyr variant showing higher insecticidal activity compared to the 7- 
Tyr and wild type proteins. The 7-Tyr variant was unstable upon long term storage at 
4°C and thus was not pursued fiirther. 
Immimoblotting 

In order to test if the glycan moiety on patatin was important for binding of IgE, Pat 17 
was expressed in E.coli to produce a nonglycosylated protein and in com to produce a 
plant glycosylated protein. The 5-Tyr variant was also expressed in E.coli to assess the 
individual contribution of the linear epitopes without the glycan moiety on the protein. 
.The proteins were tested for binding to IgE using sera firom five patients with allergy to 
potatoes and sera fi-om one patient with allergies to many things but no allergy to 
potatoes. Proteins from both com and E.coli were purified to homogeneity. These 
proteins were transferred to PVDF membrane (Millipore, MA) and subsequently probed 
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with sera from patients with and without allergy to potatoes. A His-tagged glyphosate 
oxidase control was included in all the studies to verify that the His-tag did not affect the 
binding of IgE. Serum obtained from patient HS-07 (no allergy to potatoes) did not bind 
Patl7 expressed in E.coli but showed good binding to Pat 17 from com and also a protein 
at the same molecular weight in potato extract. It is interesting to note that this sera also 
showed strong binding to another protein (> 46kDa) in the potato. Sera from patients 
HS-01, HS-02, HS-03, HS-05 (allergy to potatoes) shows strong binding to Patl7 
expressed in com, but very weak to no binding to Patl7 produced in E.coli. Also, the 
sera from patients HS-01, HS-2, HS-03 and HS-05 bound to a protein of similar 
molecular weight in the potato extract. Sera from patients HS-01, HS-02 and HS-03 also 
showed binding to another protein in potato extract of a lower molecular weight (< 
30kDa). Serum obtained from patient HS-06 (allergic to potatoes) showed very strong 
binding to wild type patatin expressed in both com and E.coli but weaker binding to the 
5-Tyr variant expressed in E.coli. Sera from HS-06 also showed very strong binding to a 
protein in potato extract with similar molecular weight as patatin. The sera from all the 
patients showed no binding to His-tagged glyphosate oxidase indicating that the His-tag 
does not bind IgE. These results strongly suggest that the glycan moiety on Pat 17 is 
responsible for IgE binding in some potato allergic patients and linear epitopes also 
contribute to the antigenicity of patatin. 

Example 20: Alternative nucleic acid and protein sequences 

For fiiture variations of the patatin protein, sequences showing high similarity to 
the sequences disclosed herein could be used in producing deallergenized patatin proteins 
and permuteins. For example, a BLAST search (Altschul, S.F. et al., J. Mol. Biol. 215: 
403-410, 1990) can be performed to identify additional patatin sequences. Sources other 
than those disclosed herein can be used to obtain a patatin nucleic acid sequence, and the 
encoded patatin protein. Furthermore, subunit sequences from different organisms can be 
combined to create a novel patatin sequence incorporating structural, regulatory, and 
enzymatic properties from different sources. 
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Example 2 1 : Nucleic acid mutation and hybridization 



Variations in the nucleic acid sequence encoding a patatin protein may lead to 
mutant patatin protein sequences that display equivalent or superior enzymatic 
characteristics when compared to the sequences disclosed herein. This invention 
accordingly encompasses nucleic acid sequences which are similar to the sequences 
disclosed herein, protein sequences which are similar to the sequences disclosed herein, 
and the nucleic acid sequences that encode them. Mutations can include deletions, 
insertions, truncations, substitutions, fusions, shuffling of subunit sequences, and the like. 

Mutations to a nucleic acid sequence can be introduced in either a specific or 
random manner, both of which are well known to those of skill in the art of molecular 
biology. A myriad of site-directed mutagenesis techniques exist, typically using 
oligonucleotides to introduce mutations at specific locations in a nucleic acid sequence. 
Examples include single strand rescue (Kunkel, T. Proc. Natl. Acad. Sci. U.S.A., 82: 488- 
492, 1985), unique site elimination (Deng and Nickloff, Anal. Biochem. 200: 81, 1992), 
nick protection (Vandeyar, et al. Gene 65: 129-133, 1988), and PGR (Costa, et al. 
Methods Mol. Biol. 57: 31-44, 1996). Random or non-specific mutations can be 
generated by chemical agents (for a general review, see Singer and Kusmierek, Ann. Rev. 
Biochem. 52: 655-693, 1982) such as nitrosoguanidine (Cerda-Olmedo et al., J. Mol. 
Biol. 33: 705-719, 1968; Guerola, et al. Nature New Biol. 230: 122-125, 1971) and 2- 
aminopurine (Rogan and Bessman, /. Bacterial. 103: 622-633, 1970), or by biological 
methods such as passage through mutator strains (Greener et al. Mol. Biotechnol. 7: 189- 
195, 1997). 

Nucleic acid hybridization is a technique well known to those of skill in the art of 
DNA manipulation. The hybridization properties of a given pair of nucleic acids is an 
indication of their similarity or identity. Mutated nucleic acid sequences can be selected 
for their similarity to the disclosed patatin nucleic acid sequences on the basis of their 
hybridization to the disclosed sequences. Low stringency conditions can be used to select 
sequences with multiple mutations. One may wish to employ conditions such as about 
0.15 M to about 0.9 M sodium chloride, at temperatures ranging from about 20°C to 
about 55°C. High stringency conditions can be used to select for nucleic acid sequences 
with higher degrees of identity to the disclosed sequences. Conditions employed may 
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include about 0.02 M to about 0.15 M sodium chloride, about 0.5% to about 5% casein, 
about 0.02% SDS and/or about 0.1% N-laurylsarcosine, about 0.001 M to about 0.03 M 
sodium citrate, at temperatures between about 50°C and about 70°C. More preferably, 
high stringency conditions are 0.02 M sodium chloride, 0.5% casein, 0.02% SDS, 0.001 
M sodium citrate, at a temperature of 50°C. 

Example 22: Determination of homologous and degenerate nucleic acid s equences 

Modification and changes can be made in the sequence of the proteins of the 
present invention and the nucleic acid segments which encode them and still obtain a 
functional molecule that encodes a protein with desirable properties. The following is a 
discussion based upon changing the amino acid sequence of a protein to create an 
equivalent, or possibly an improved, second-generation molecule. The amino acid 
changes can be achieved by changing the codons of the nucleic acid sequence, according 
to the codons given in Table 10. 



Table 10: Codon degeneracies of amino acids 



Amino acid 


One letter 


\ Three letter 


Cddpiisj'^^I^^Lj/^^. 


Alanine 


A 


Ala 


GCA GCC GCG GCT 


Cysteine 


C 


Cys 


TGC TGT 


Aspartic acid 


D 


Asp 


GAC GAT 


Glutamic acid 


E 


Glu 


GAA GAG 


Phenylalanine 


F 


Phe 


TTC TTT 


Glycine 


G 


Gly 


GGA GGC GGG GGT 


Histidine 


H 


His 


CAC CAT 


Isoleucine 


1 


He 


ATA ATC ATT 


Lysine 


K 


Lys 


AAA AAG 


Leucine 


L 


Leu 


TTA TTG CTA CTC CTG CTT 


Methionine 


M 


Met 


ATG 


Asparagine 


N 


Asn 


AAC AAT 


Proline 


P 


Pro 


CCA CCC CCG CCT 


Glutamine 


Q 


Gin 


CAA CAG 


Arginine 


R 


Arg 


AGA AGG CGA CGC CGG CGT 


Serine 


S 


Ser 


AGC AGT TCA TCC TCG TCT 


Threonine 


T 


Thr 


ACA ACC ACG ACT 


Valine 


V 


Val 


GTA GTC GTGGTT 


1 Tryptophan 


W 


Trp 


TGG 


1 Tyrosine 


Y 


Tyr 


TAC TAT 1 
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Certain amino acids can be substituted for other amino acids in a protein sequence 
without appreciable loss of enzymatic activity. It is thus contemplated that various 
changes can be made in the peptide sequences of the disclosed protein sequences, or their 
corresponding nucleic acid sequences without appreciable loss of the biological activity. 

In making such changes, the hydropathic index of amino acids can be considered. 
The importance of the hydropathic amino acid index in conferring interactive biological 
function on a protein is generally understood in the art (Kyte and Doolittle, J. Mol. Biol., 
157: 105-132, 1982). It is accepted that the relative hydropathic character of the amino 
acid contributes to the secondary structure of the resultant protein, which in turn defines 
the interaction of the protein with other molecules, for example, enzymes, substrates, 
receptors, DNA, antibodies, antigens, and the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics. These are: isoleucine (+4.5); valine (+4.2); 
leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine 
(+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); 
proline (-1.6); histidine (-3.2); glutamate/glutamine/aspartate/asparagine (-3.5); lysine (- 
3.9); and arginine (-4.5). 

It is known in the art that certain amino acids can be substituted by other amino 
acids having a similar hydropathic index or score and still result in a protein with similar 
biological activity, i.e., still obtain a biologically functional protein. In making such 
changes, the substitution of amino acids whose hydropathic indices are within ±2 is 
preferred, those within ±1 are more preferred, and those within ±0.5 are most preferred. 

It is also understood in the art that the substitution of like amino acids can be 
made effectively on the basis of hydrophilicity. U.S. Patent No. 4,554,101 (Hopp, T.P., 
issued November 19, 1985) states that the greatest local average hydrophilicity of a 
protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a 
biological property of the protein. The following hydrophilicity values have been 
assigned to amino acids: arginine/lysine (+3.0); aspartate/glutamate (+3.0 ±1); serine 
(+0.3); asparagine/glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ±1); 
alanine/histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine/isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); and tryptophan (-3.4). 
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It is understood that an amino acid can be substituted by another amino acid 
having a similar hydrophilicity score and still result in a protein with similar biological 
activity, i.e., still obtain a biologically functional protein. In making such changes, the 
substitution of amino acids whose hydropathic indices are within ±2 is preferred, those 
within ±1 are more preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substitutions are therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of 
the foregoing characteristics into consideration are well known to those of skill in the art 
and include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine 
and asparagine; and valine, leucine, and isoleucine. Changes which are not expected to 
be advantageous may also be used if these resulted in functional patatin proteins. 
Example 23: Production of patatin proteins and permuteins in plants 

Plant Vectors 

In plants, transformation vectors capable of introducing nucleic acid sequences 
encoding patatin proteins and permuteins are easily designed, and generally contain one 
or more nucleic acid coding sequences of interest under the transcriptional control of 5' 
and 3' regulatory sequences. Such vectors generally comprise, operatively linked in 
sequence in the 5' to 3' direction, a promoter sequence that directs the transcription of a 
downstream heterologous structural nucleic acid sequence in a plant; optionally, a 5' non- 
translated leader sequence; a nucleic acid sequence that encodes a protein of interest; and 
a 3' non-translated region that encodes a polyadenylation signal which functions in plant 
cells to cause the termination of transcription and the addition of polyadenylate 
nucleotides to the 3' end of the mRNA encoding the protein. Plant transformation 
vectors also generally contain a selectable marker. Typical 5 '-3' regulatory sequences 
include a transcription initiation start site, a ribosome binding site, an RNA processing 
signal, a transcription termination site, and/or a polyadenylation signal. Vectors for plant 
transformation have been reviewed in Rodriguez et al. (Vectors: A Survey of Molecular 
Cloning Vectors and Their Uses, Butterworths, Boston., 1988), Click et al. (Methods in 
Plant Molecular Biology and Biotechnology, CRC Press, Boca Raton, Fla., 1993), and 
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1 Croy (Plant Molecular Biology Labfax, Hames and Rickwood (Eds.), BIOS Scientific 

2 Publishers Limited, Oxford, UK., 1993). 

3 Plant Promoters 

4 Plant promoter sequences can be constitutive or inducible, environmentally- or 

5 developmentally-regulated, or cell- or tissue-specific. Often-used constitutive promoters 

6 include the CaMV 35S promoter (Odell, J.T. et al.. Nature 313: 810-812, 1985), the 

7 enhanced CaMV 35S promoter, the Figwort Mosaic Virus (FMV) promoter (Richins et 

8 al.. Nucleic Acids Res. 20: 8451-8466, 1987), the mannopine synthase (mas) promoter, 

9 the nopaline synthase (nos) promoter, and the octopine synthase (pes) promoter. Useful 

10 inducible promoters include promoters induced by salicylic acid or polyacrylic acids (PR- 

11 1, Williams , S. W. et al. Biotechnology 10: 540-543, 1992), induced by application of 

12 safeners (substituted benzenesulfonamide herbicides, Hershey, H.P. and Stoner, T.D., 

13 Plant Mol. Biol. 17: 679-690, 1991), heat-shock promoters (Ou-Lee et al., Proc. Natl. 

14 Acad Sci. U.S.A. 83: 6815-6819, 1986; Ainley et al.. Plant Mol. Biol. 14: 949-967, 

15 1990), a nitrate-inducible promoter derived from the spinach nitrite reductase gene (Back 

16 et al.. Plant Mol. Biol. 17: 9-18, 1991), hormone-indue ible promoters (Yamaguchi- 

17 Shinozaki, K. et al., Plant Mol. Biol. 15: 905-912, 1990; Kares et al., Plant Mol. Biol. 15: 

18 225-236, 1990), and light-inducible promoters associated with the small subunit of RuBP 

19 carboxylase and LHCP gene families (Kuhleraeier et al.. Plant Cell 1: 471, 1989; 

20 Feinbaum, R.L. et al., Mol. Gen. Genet. 226: 449-456, 1991; Weisshaar, B. et al., EMBO 

21 J. 10: 1777-1786, 1991; Lam, E. and Chua, N.H., J. Biol. Chem. 266: 17131-17135, 

22 1990; Castresana, C. et al., EMBO J. 7: 1929-1936, 1988; Schulze-Lefert et al., EMBO J. 

23 8: 651, 1989). Examples of useful tissue-specific, developmentally-regulated promoters 

24 include the p-conglycinin 7S promoter (Doyle, J.J. et al., J. Biol. Chem. 261: 9228-9238, 

25 1986; Slighton and Beachy, Planta 172: 356-363, 1987), and seed-specific promoters 

26 (Knutzon, D.S. et al., Proc. Natl. Acad Sci. U.S.A. 89: 2624-2628, 1992; Bustos, M.M. et 

27 al., EMBO J. 10: 1469-1479, 1991; Lam and Chua, Science 248: 471, 1991; Stayton et 

28 al., Aust. J. Plant. Physiol. 18: 507, 1991). Plant functional promoters useful for 

29 preferential expression in seed plastids include those from plant storage protein genes and 

30 from genes involved in fatty acid biosynthesis in oilseeds. Examples of such promoters 
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1 include the 5' regulatory regions from such genes as napin (Kridl et al., Seed Sci. Res. 1 : 

2 209-219, 1991), phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP 

3 desaturase, and oleosin. Seed-specific gene regulation is discussed in EP 0 255 378. 

4 Promoter hybrids can also be constructed to enhance transcriptional activity (Comai, L. 

5 and Moran, P.M., U.S. Patent No. 5,106,739, issued April 21, 1992), or to combine 

6 desired transcriptional activity and tissue specificity. 

7 Plant transformation and regeneration 

8 A variety of different methods can be employed to introduce such vectors into 

9 plant protoplasts, cells, callus tissue, leaf discs, meristems, etcetera, to generate 

10 transgenic plants, including Agrobacterium-mc^\2A&d transformation, particle gun 

11 delivery, microinjection, electroporation, polyethylene glycol mediated protoplast 

12 transformation, liposome-mediated transformation, etcetera (reviewed in Potrykus, 1. 

13 Ann. Rev. Plant Physiol. Plant Mol. Biol. 42: 205-225, 1991). In general, transgenic 

14 plants comprising cells containing and expressing DNAs encoding patatin proteins and 

15 permuteins can be produced by transforming plant cells with a DNA construct as 

16 described above via any of the foregoing methods; selecting plant cells that have been 

17 transformed on a selective medium; regenerating plant cells that have been transformed 
ig to produce differentiated plants; and selecting a transformed plant which expresses the 

19 protein-encoding nucleotide sequence. 

20 Sj)ecific methods for transforming a wide variety of dicots and obtaining 

21 transgenic plants are well documented in the literature (Gasser and Fraley, Science 244: 

22 1293-1299, 1989; Fisk and Dandekar, Scientia Horticulturae 55: 5-36, 1993; Christou, 

23 Agra Food Industry Hi Tech, p. 1 7, 1 994; and the references cited therein). 

24 Successful transformation and plant regeneration have been reported in the 

25 monocots as follows: asparagus {Asparagus officinalis; Bytebier et al., Proc. Natl. Acad. 

26 Sci. U.S. A. 84: 5345-5349, 1987); barley {Hordeum vulgarae; Wan and Lemaux, Plant 

27 PhysioL 104: 37-48, 1994); maize {Zea mays; Rhodes, C.A. et al.. Science 240: 204-207, 

28 1988; Gordon-Kamm et al.. Plant Cell 2: 603-618, 1990; Fromm, M.E. et al., 

29 Bio/Technology 8: 833-839, 1990; Koziel et al., Bio/Technology 1 1 : 194-200, 1993); oats 

30 {Avena sativa; Somers et al., Bio/Technology 10: 1589-1594, 1992); orchardgrass 
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1 (Dactylis glomerata; Horn et al., Plant Cell Rep. 7: 469-472, 1988); rice {Oryza sativa, 

2 including indica and japonica varieties; Toriyama et al., Bio/Technology 6: 10, 1988; 

3 Zhang et ai.. Plant Cell Rep. 7: 379-384, 1988; Luo and Wu, Plant Mol. Biol. Rep. 6: 

4 165-174, 1988; Zhang and Wu, Theor. Appl. Genet. 76: 835-840, 1988; Christou et al., 

5 Bio/Technology 9: 957-962, 1991); rye (Secale cereale; De la Pena et al., Nature 325: 

6 274-276, 1987); sorghum (Sorghum bicolor; Casas, A.M. et al., Proc. Natl Acad. Sci. 

7 U.S.A. 90: 11212-1 1216, 1993); sugar cane {Saccharum spp.; Bower and Birch, Plant J. 

8 2: 409-416, 1992); tall fescue {Festuca arundinacea; Wang, Z.Y. et al., Bio/Technology 

9 10: 691-696, 1992); turfgrass (Agrostis palustris; Zhong et al., Plant Cell Rep. 13: 1-6, 

10 1993); wheat {Triticum aestivum; Vasil et al., Bio/Technology 10: 661 -61 A, 1992; Weeks, 

11 T. et al.. Plant Physiol. 102: 1077-1084, 1993; Becker et al., Plant J. 5: 299-307, 1994), 

12 and alfalfa (Masoud, S.A. et al., Transgen. Res. 5: 313, 1996); Brassica (canola/oilseed 

13 rape) (Fry, J. Plant Cell Rep. 6: 321-325, 1987); and soybean (Hinchee, M. Bio/Technol. 

14 6:915-922, 1988). 

15 All of the compositions and/or methods disclosed and claimed herein can be made 

16 and executed without undue experimentation in light of the present disclosure. While the 

17 compositions and methods of this invention have been described in terms of preferred 

18 embodiments, it will be apparent to those of skill in the art that variations can be applied 

19 to the compositions and/or methods and in the steps or in the sequence of steps of the 

20 methods described herein without departing from the concept, spirit and scope of the 

21 invention. More specifically, it will be apparent that certain agents which are both 

22 chemically and physiologically related can be substituted for the agents described herein 

23 while the same or similar results would be achieved. All such similar substitutes and 

24 modifications apparent to those skilled in the art are deemed to be within the spirit, scope 

25 and concept of the invention. 

26 
27 
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SEQUENCE LISTING 



<110> ALIBHAI, MURTAZA F. 
ASTWOOD, JAMES D. 
SAMPSON, HUGH A. 
McWHERTER, CHARLES F. 



<120> PREPARATION OF 


DEALLERGENIZED PATATIN 


1 PROTEINS AND PERMUTEINS 




<130> MOBT217 












<160> 293 














<170> Patentin version 3.0 










<210> 1 
<211> 1158 
<212> DNA 

<213> Solanum Tuberosum 










<400> 1 
atggcaacta 


ctaaatcttt 


tttaatttta 


atatttatga 


tattagcaac 


tactagttca 


60 


acatttgctc 


agttgggaga 


aatggtgact 


gttcttagta 


t tgatggagg 


tggaattaga 


120 


gggatcattc 


cggctaccat 


tctcgaattt 


cttgaaggac 


aacttcagga 




180 


aatgcagatg 


caagacttgc 


agattacttt 


gatgtaattg 


gaggaacaag 


tacaggaggt 


240 


ttattgactg 


ctatgataag 


tactccaaat 


gaaaacaatc 


gaccctttgc 




300 


gaaattgtac 


ctttttactt 


cgaacatggc 


cctcagattt 






360 


ttaggcccaa 


aatatgatgg 


aaaatatctt 


atgcaagttc 


ttcaagaaaa 




420 


actcgtgtgc 


atcaagcttt 


gacagaagtt 


gtcatctcaa 






480 


aagccagtaa 


tattcactaa 


gtcaaattta 








540 


tatgacataa 


gttattccac 


agcagcagct 






ttactttgtt 


600 


actaatacta 


gtaatggaga 


tgaatatgag 


ttcaatcttg 


ttgatggtgc 


tgttgctact 


660 


gttgctgatc 


cggcgttatt 


atccattagc 


gttgcaacga 


gacttgcaca 


aaaggatcca 


720 


gcatttgctt 


caattaggtc 


attgaattac 


aaaaaaatgc 


tgttgctctc 


attaggcact 


780 


ggcactactt 


cagagtttga 


taaaacatat 


acagcaaaag 


aggcagctac 


ctggactgct 


840 


gtacattgga 


tgttagttat 


acagaaaatg 


actgatgcag 


caagttctta 


catgactgat 


900 


tattaccttt 


ctactgcttt 


tcaagctctt 


gattcaaaaa 


acaattacct 


cagggttcaa 


960 


gaaaatgcat 


taacaggcac 


aactactgaa 


atggatgatg 


cttctgaggc 


taatatggaa 


1020 


ttattagtac 


aagttggtga 


aaacttattg 


aagaaaccag 


tttccgaaga 


caatcctgaa 


1080 
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2 acctatgagg aagctctaaa gaggtttgca aaattgctct ctgataggaa gaaactccga 1140 
3 

4 gcaaacaaag cttcttat 1158 

5 

6 <210> 2 

7 <211> 386 

8 <212> PRT 

9 <213> Solanum Tuberosum 
10 

11 <400> 2 
12 

13 Met Ala Thr Thr Lys Ser Phe Leu lie Leu lie Phe Met lie Leu Ala 

14 1 5 10 15 
15 

16 Thr Thr Ser Ser Thr Phe Ala Gin Leu Gly Glu Met Val Thr Val Leu 

17 20 25 30 
18 

19 Ser He Asp Gly Gly Gly He Arg Gly He He Pro Ala Thr He Leu 

20 35 40 45 
21 

22 Glu Phe Leu Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala Asp Ala 

23 50 55 60 
24 

25 Arg Leu Ala Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly 

26 65 70 75 80 
27 

28 Leu Leu Thr Ala Met He Ser Thr Pro Asn Glu Asn Asn Arg Pro Phe 

29 8 5 9 0 9 5 
30 

31 Ala Ala Ala Lys Glu He Val Pro Phe Tyr Phe Glu His Gly Pro Gin 

32 100 105 110 
33 

34 He Phe Asn Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp Gly Lys 

35 115 120 125 
36 

37 Tyr Leu Met Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His 

38 130 135 140 
39 

40 Gin Ala Leu Thr Glu Val Val He Ser Ser Phe Asp He Lys Thr Asn 

41 145 150 155 160 
42 

43 Lys Pro Val He Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro Glu Leu 

44 1 65 1 70 1 75 
45 

46 Asp Ala Lys Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala Pro Thr 

47 180 185 190 
48 

49 Tyr Phe Pro Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly Asp Glu 

50 195 200 205 
51 

52 Tyr Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala Asp Pro 

53 2 1 0 2 1 5 2 2 0 
54 

55 Ala Leu Leu Ser He Ser Val Ala Thr Arg Leu Ala Gin Lys Asp Pro 

56 225 230 235 240 
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Ala Phe Ala Ser He Arg Ser Leu Asn Tyr Lys Lys Met Leu Leu Leu 
245 250 255 

Ser Leu Gly Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr Thr Ala 
260 265 270 

Lys Glu Ala Ala Thr Trp Thr Ala Val His Trp Met Leu Val He Gin 
275 280 285 

Lys Met Thr Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser 
290 295 300 

Thr Ala Phe Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin 
305 310 315 320 

Glu Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu 
325 330 335 

Ala Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu Lys Lys 
340 345 350 

Pro Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg 
355 360 365 

Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala 
370 375 380 

Ser Tyr 
385 

<210> 3 
<211> 54 
<212> DNA 
<213> Synthetic 

<400> 3 

ggagctcgag aaaagagagg ctgaagctca gttgggagaa atggtgactg ttct 54 

<210> 4 

<211> 29 

<212> DNA 

<213> Synthetic 

<400> 4 

ggtctagagg aattctcatt aataagaag 29 

<210> 5 

<211> 1138 

<212> DNA 

<213> Synthetic 

<400> 5 

ggagctcgag aaaagagagg ctgaagctca gttgggagaa atggtgactg ttcttagtat 60 

tgatggaggt ggaattagag ggatcattcc ggctaccatt ctcgaatttc ttgaaggaca 120 

acttcaggaa atggacaata atgcagatgc aagacttgca gattactttg atgtaattgg 180 
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acaggaggtt 


tattgactgc 


tatgataagt 


actccaaatg 


aaaacaatcg 


240 




gctgccaaag 


aaattgtacc 


tttttacttc 


gaacatggcc 


ctcagatttt 


300 




ggtcaaattt 


taggcccaaa 


atatgatgga 


aaatatctta 


tgcaagttct 


360 


tea 3*3^3 sa a 3 


cttggagaaa 


ctcgtgtgca 


tcaagctttg 


acagaagttg 


tcatctcaag 


420 






agccagtaat 


attcactaag 


tcaaatttag 


caaactctcc 


480 




gctaagatgt 


atgacataag 


ttattccaca 


gcagcagctc 


caacatattt 


540 




tactttgtta 


ctaatactag 


taatggagat 


gaatatgagt 


tcaatcttgt 


600 






ttgctgatcc 


ggcgttatta 


tccattagcg 


ttgcaacgag 


660 










ttgaattaca 


aaaaaatgct 


720 








agagtttgat 


aaaacatata 


cagcaaaaga 


780 






tacattggat 


gttagttata 


cagaaaatga 


ctgatgcagc 


840 


aagttcttac 


atgactgatt 


attacctttc 


tactgctttt 


caagctcttg 


attcaaaaaa 


900 


caattacctc 


agggttcaag 


aaaatgcatt 


aacaggcaca 


actactgaaa 


tggatgatgc 


960 


ttctgaggct 


aatatggaat 


tattagtaca 


agttggtgaa 


aacttattga 


agaaaccagt 


1020 


ttccgaagac 


aatcctgaaa 


cctatgagga 


agctctaaag 


aggtttgcaa 


aattgctctc 


1080 


tgataggaag 


aaactccgag 


caaacaaagc 


ttcttattaa 


tgagaattcc 


tctagacc 


1138 



<210> 6 

<211> 452 

<212> PRT 

<213> Synthetic 

<400> 6 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

lie Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe He Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

Ser Leu Glu Lys Arg Glu Ala Glu Ala Gin Leu Gly Glu Met Val Thr 
85 90 95 
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2 Val Leu Ser He Asp Gly Gly Gly He Arg Gly He He Pro Ala Thr 

3 100 105 110 
4 

5 He Leu Glu Phe Leu Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala 

6 115 120 125 
7 

8 Asp Ala Arg Leu Ala Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr 

9 130 135 140 
10 

11 Gly Gly Leu Leu Thr Ala Met He Ser Thr Pro Asn Glu Asn Asn Arg 

12 145 150 155 160 
13 

14 Pro Phe Ala Ala Ala Lys Glu He Val Pro Phe Tyr Phe Glu His Gly 

15 165 170 175 
16 

17 Pro Gin He Phe Asn Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp 

18 180 185 190 
19 

20 Gly Lys Tyr Leu Met Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg 

21 195 200 205 
22 

23 Val His Gin Ala Leu Thr Glu Val Val He Ser Ser Phe Asp He Lys 

24 2 1 0 2 1 5 2 2 0 
25 

26 Thr Asn Lys Pro Val He Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro 

27 225 230 235 240 
28 

29 Glu Leu Asp Ala Lys Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala 

30 245 250 255 
31 

32 Pro Thr Tyr Phe Pro Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly 

33 2 6 0 2 6 5 2 7 0 
34 

35 Asp Glu Tyr Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala 

36 275 280 285 
37 

38 Asp Pro Ala Leu Leu Ser He Ser Val Ala Thr Arg Leu Ala Gin Lys 

39 2 9 0 2 9 5 3 0 0 
40 

41 Asp Pro Ala Phe Ala Ser He Arg Ser Leu Asn Tyr Lys Lys Met Leu 

42 305 310 315 320 
43 

44 Leu Leu Ser Leu Gly Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr 

45 325 330 335 
46 

47 Thr Ala Lys Glu Ala Ala Thr Trp Thr Ala Val His Trp Met Leu Val 

48 3 4 0 3 4 5 3 5 0 
49 

50 He Gin Lys Met Thr Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr 

51 355 360 365 
52 

53 Leu Ser Thr Ala Phe Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg 

54 370 375 380 
55 

56 Val Gin Glu Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala 

57 3 8 5 3 9 0 3 9 5 4 0 0 
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2 Ser Glu Ala Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu 

3 405 410 415 
4 

5 Lys Lys Pro Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu 

6 420 425 430 
7 

8 Lys Arg Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn 

9 435 440 445 
10 

11 Lys Ala Ser Tyr 

12 450 
13 

14 <210> 7 

15 <211> 367 

16 <212> PRT 

17 <213> Synthetic 
18 

19 <400> 7 
20 

21 Glu Ala Glu Ala Gin Leu Gly Glu Met Val Thr Val Leu Ser He Asp 

22 1 5 10 15 
23 

24 Gly Gly Gly He Arg Gly He He Pro Ala Thr He Leu Glu Phe Leu 

25 20 25 30 
26 

27 Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala Asp Ala Arg Leu Ala 

28 35 40 45 
29 

30 Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly Leu Leu Thr 

31 50 55 60 
32 

33 Ala Met He Ser Thr Pro Asn Glu Asn Asn Arg Pro Phe Ala Ala Ala 

34 6 5 7 0 7 5 8 0 
35 

36 Lys Glu He Val Pro Phe Tyr Phe Glu His Gly Pro Gin He Phe Asn 

37 85 90 95 
38 

39 Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp Gly Lys Tyr Leu Met 

40 1 00 1 05 110 
41 

42 Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His Gin Ala Leu 

43 115 120 125 
44 

45 Thr Glu Val Val He Ser Ser Phe Asp He Lys Thr Asn Lys Pro Val 

46 130 135 140 
47 

48 He Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro Glu Leu Asp Ala Lys 

49 145 150 155 160 
50 

51 Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala Pro Thr Tyr Phe Pro 

52 1 65 1 70 1 75 
53 

54 Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly Asp Glu Tyr Glu Phe 

55 180 185 190 
56 

57 Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala Asp Pro Ala Leu Leu 
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Ser lie Ser Val Ala Thr Arg Leu Ala Gin Lys Asp Pro Ala Phe Ala 
210 215 220 

Ser lie Arg Ser Leu Asn Tyr Lys Lys Met Leu Leu Leu Ser Leu Gly 
225 230 235 240 

Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr Thr Ala Lys Glu Ala 
245 250 255 

Ala Thr Trp Thr Ala Val His Trp Met Leu Val He Gin Lys Met Thr 
260 265 270 

Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser Thr Ala Phe 
275 280 285 

Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin Glu Asn Ala 
290 295 300 

Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu Ala Asn Met 
305 310 315 320 

Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu Lys Lys Pro Val Ser 

325 330 335 

Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg Phe Ala Lys 
340 345 350 

Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala Ser Tyr 
355 360 365 

<210> 8 
<211> 24 
<212> DNA 
<213> Synthetic 

<400> 8 

atgttcgaag aaaaaaggta caat 

<210> 9 

<211> 24 

<212> DNA 

<213> Synthetic 

<400> 9 

ttgcataaga aattttccat cata 

<210> 10 

<211> 24 

<212> DNA 

<213> Synthetic 

<400> 10 

tgctgtggaa aaacttatgt cata 
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1 <211> 24 

2 <212> DNA 

3 <213> Synthetic 
4 

5 <400> 11 

6 cggaggaaaa aatgttggag ctgc 24 
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8 <210> 12 

9 <211> 51 

10 <212> DNA 

11 <213> Synthetic 
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14 atgcggagga aaaaatgttg gagctgctgc tgtggaaaaa cttatgtcat a 51 
15 
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17 <211> 24 
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19 <213> Synthetic 
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22 ttttgctgta aatgttttat caaa 24 
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24 <210> 14 

25 <211> 24 
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30 aaccctgagg aaattgtttt ttga 24 
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38 agcttcctca aaggtttcag gatt 24 
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41 <211> 10 

42 <212> PRT 

43 <213> Synthetic 
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45 <400> 15 
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47 Gin Leu Gly Glu Met Val Thr Val Leu Ser 

48 1 5 10 
49 

50 <210> 17 

51 <211> 10 

52 <212> PRT 

53 <213> Synthetic 
54 

55 <400> 17 
56 

57 Met Val Thr Val Leu Ser He Asp Gly Gly 
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Glu Gly Gin Leu Gin Glu Met Asp Asn Asn 



<210> 24 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 24 

Gin Glu Met Asp Asn Asn Ala Asp Ala Arg 
15 10 

<210> 25 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 25 

Asn Asn Ala Asp Ala Arg Leu Ala Asp Tyr 
15 10 

<210> 26 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 26 

Ala Arg Leu Ala Asp Tyr Phe Asp Val He 



<210> 27 
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<212> PRT 

<213> Synthetic 

<40Q> 27 

Asp Tyr Phe Asp Val He Gly Gly Thr Ser 
15 10 

<210> 28 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 28 

Val He Gly Gly Thr Ser Thr Gly Gly Leu 
1 5 10 

<210> 29 
<211> 10 
<212> PRT 
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<213> Synthetic 
<400> 29 

Thr Ser Thr Gly Gly Leu Leu Thr Ala Met 
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Gly Leu Leu Thr Ala Met lie Ser Thr Pro 
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<212> PRT 

<213> Synthetic 

<400> 33 

Asn Asn Arg Pro Phe Ala Ala Ala Lys Glu 
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<210> 34 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 34 

Phe Ala Ala Ala Lys Glu He Val Pro Phe 
1 5 10 
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<211> 10 
<212> PRT 
<213> Synthetic 

<400> 52 

Ser Asn Leu Ala Asn Ser Pro Glu Leu Asp 
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<210> 53 
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2 
3 



<211> 10 
<212> PRT 
<213> Synthetic 



<400> 109 



6 



7 Ala Phe Tyr Phe Glu His Gly Pro Gin lie 



9 

10 <210> 110 

11 <211> 10 

12 <212> PRT 

13 <213> Synthetic 
14 

15 <400> 110 
16 

17 Pro Ala Tyr Phe Glu His Gly Pro Gin He 

18 1 5 10 
19 

20 <210> 111 

21 <2H> 10 

22 <212> PRT 

23 <213> Synthetic 
24 

25 <400> 111 
26 

27 Pro Phe Ala Phe Glu His Gly Pro Gin He 

28 1 5 10 
29 

30 <210> 112 

31 <211> 10 

32 <212> PRT 

33 <213> Synthetic 
34 

35 <400> 112 
36 

37 Pro Phe Tyr Ala Glu His Gly Pro Gin He 

38 1 5 10 

39 

40 <210> 113 

41 <211> 10 

42 <212> PRT 

43 <213> Synthetic 
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45 <400> 113 
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47 Pro Phe Tyr Phe Ala His Gly Pro Gin He 

48 1 5 10 
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51 <211> 10 

52 <212> PRT 

53 <213> Synthetic 
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55 <400> 114 
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11 5 10 

2 

3 <210> 115 

4 <211> 10 

5 <212> PRT 

6 <213> Synthetic 
7 

8 <400> 115 
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10 Pro Phe Tyr Phe Glu His Ala Pro Gin lie 

11 1 5 10 
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13 <210> 116 

14 <211> 10 

15 <212> PRT 

16 <213> Synthetic 
17 

18 <400> 116 
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20 Pro Phe Tyr Phe Glu His Gly Ala Gin He 

21 1 5 10 
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23 <210> 117 

24 <211> 10 

25 <212> PRT 

26 <213> Synthetic 
27 

28 <400> 117 
29 

30 Pro Phe Tyr Phe Glu His Gly Pro Ala He 

31 1 5 10 
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33 <210> 118 

34 <211> 10 

35 <212> PRT 

36 <213> Synthetic 
37 

38 <400> 118 
39 

40 Pro Phe Tyr Phe Glu His Gly Pro Gin Ala 

41 1 5 10 
42 

43 <210> 119 

44 <211> 10 

45 <212> PRT 

46 <213> Synthetic 
47 

48 <400> 119 
49 

50 Thr Phe Tyr Leu Glu Asn Gly Pro Lys He 

51 1 5 10 
52 

53 <210> 120 

54 <211> 10 

55 <212> PRT 

56 <213> Synthetic 
57 
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<212> PRT 
<213> Synthetic 



<400> 126 



Lys Tyr Leu Met Gin Ala Leu Gin Glu Lys 
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Lys Tyr Leu Met Gin Val Ala Gin Glu Lys 
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<400> 160 

Pro Ala Phe Ala Ser He Arg Ser Leu Asn 
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<211> 10 

<212> PRT 

<213> Synthetic 

<400> 161 

Pro Asn Phe Asn Ala Gly Ser Pro Thr Glu 
1 5 10 

<210> 162 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 162 

Lys Met Leu Leu Leu Ser Leu Gly Thr Gly 
1 5 10 

<210> 163 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 163 

Asn Tyr Leu He He Ser Val Gly Thr Gly 
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<211> 10 

<212> PRT 

<213> Synthetic 

<400> 164 

Lys Met Leu Leu Leu Ser Leu Gly Ala Gly 
1 5 10 

<210> 165 

<211> 10 

<212> PRT 

<213> Synthetic 
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Ala Glu Phe Asp Lys Thr Tyr Thr Ala Lys 
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<210> 166 
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<211> 10 
<212> PRT 
<213> Synthetic 



Ser Ala Phe Asp Lys Thr Tyr Thr Ala Lys 
1 5 10 

<210> 167 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 167 

Ser Glu Ala Asp Lys Thr Tyr Thr Ala Lys 



<210> 168 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 168 

Ser Glu Phe Ala Lys Thr Tyr Thr Ala Lys 
15 10 
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<211> 10 

<212> PRT 

<213> Synthetic 

<400> 169 

Ser Glu Phe Asp Ala Thr Tyr Thr Ala Lys 
1 5 10 

<210> 170 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 170 

Ser Glu Phe Asp Lys Ala Tyr Thr Ala Lys 
1 5 10 

<210> 171 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 171 

Ser Glu Phe Asp Lys Thr Ala Thr Ala Lys 
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<400> 177 

Ala Glu Lys Tyr Thr Ala Glu Gin Cys Ala 
1 5 10 

<210> 178 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 178 

Ala Thr Tyr Thr Ala Lys Glu Ala Ala Thr 
1 5 10 

<210> 179 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 179 

Lys Ala Tyr Thr Ala Lys Glu Ala Ala Thr 
1 5 10 

<210> 180 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 180 

Lys Thr Ala Thr Ala Lys Glu Ala Ala Thr 
1 5 10 

<210> 181 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 181 

Lys Thr Tyr Ala Ala Lys Glu Ala Ala Thr 
1 5 10 

<210> 182 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 182 

Lys Thr Tyr Thr Ala Ala Glu Ala Ala Thr 
1 5 10 

<210> 183 
<211> 10 
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<212> PRT 
<213> Synthetic 



Lys Thr Tyr Thr Ala Lys Ala Ala Ala Thr 
1 5 10 

<210> 184 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 184 

Lys Thr Tyr Thr Ala Lys Glu Ala Ala Ala 
15 10 

<210> 185 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 185 

Glu Lys Tyr Thr Ala Glu Gin Cys Ala Lys 
15 10 

<210> 186 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 186 

Ala Ala Phe Ala Ala Ala Glu Ala Ala Thr 



<210> 187 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 187 

Lys Thr Phe Thr Ala Lys Glu Ala Ala Thr 
15 10 

<210> 188 

<211> 10 

<212> PRT 

<213> Synthetic 



<400> 188 



Gin Ala Leu His Cys Glu Lys Lys Tyr Leu 
15 10 
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<210> 189 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 189 

Gin Ala Leu Asp Ser Lys Ala Ala Tyr Leu 
1 5 10 

<210> 190 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 190 

Gin Ala Leu Asp Ser Lys Asn Asn Phe Leu 
1 5 10 

<210> 191 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 191 

Gin Ala Leu His Cys Glu Asn Asn Phe Leu 
1 5 10 

<210> 192 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 192 

Cys Glu Lys Lys Tyr Leu Arg He Gin Asp 
1 5 10 

<210> 193 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 193 

Ser Lys Asn Asn Phe Leu Arg Val Gin Glu 
1 5 10 

<210> 194 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 194 
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Ser Glu Asn Asn Tyr Leu Arg Val Gin Glu 
1 5 10 

<210> 195 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 195 

Ala Leu Arg Val Gin Glu Asn Ala Leu Thr 
1 5 10 

<210> 196 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 196 

Tyr Ala Arg Val Gin Glu Asn Ala Leu Thr 
1 5 10 

<210> 197 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 197 

Tyr Leu Ala Val Gin Glu Asn Ala Leu Thr 
1 5 10 

<210> 198 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 198 

Tyr Leu Arg Ala Gin Glu Asn Ala Leu Thr 
1 5 10 

<210> 199 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 199 

Tyr Leu Arg Val Ala Glu Asn Ala Leu Thr 
15 10 

<210> 200 
<211> 10 
<212> PRT 
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<213> Synthetic 



3 


<400> 


200 












4 

5 


Tyr Leu Arg Val Gin 


Ala 


Asn 


Ala 


Leu 


Thr 


6 


1 


5 










10 


7 
8 


<210> 


201 












9 


<211> 


10 












10 


<212> 


PRT 












11 


<213> 


Synthetic 












12 
















13 


<400> 


201 












14 
















15 


Tyr Leu Arg Val Gin 


Glu 


Ala 


Ala 


Leu 


Thr 


16 


1 


5 










10 


17 
















18 


<210> 


202 












19 


<211> 


10 












20 


<212> 


PRT 












21 


<213> 


Synthetic 












22 
















23 


<400> 


202 












24 
















25 


Tyr Leu Arg Val Gin 


Glu 


Asn 


Ala 


Ala 


Thr 


26 


1 


5 










10 


27 
















28 


<210> 


203 












29 


<211> 


10 












30 


<212> 


PRT 












31 


<213> 


Synthetic 












32 
















33 


<400> 


203 












34 
















35 


Tyr Leu Arg Val Gin 


Glu 


Asn 


Ala 


Leu 


Ala 


36 


1 


5 










10 


37 
















38 


<210> 


204 












39 


<211> 


10 












40 


<212> 


PRT 












41 


<213> 


Synthetic 












42 
















43 


<400> 


204 












44 
















45 


Tyr Leu Arg lie Gin Asp Asp 


Thr 


Leu 


Thr 


46 


1 


5 










10 


47 
















48 


<210> 


205 












49 


<211> 


10 












50 


<212> 


PRT 












51 


<213> 


Synthetic 












52 
















53 


<400> 


205 












54 
















55 


Tyr Leu Thr Val Ala 


Ala 


Ala 


Ala 


Leu 


Thr 


56 


1 


5 










10 


57 
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1 


<210> 


206 




<211> 


10 


3 


<212> 


PRT 




<213> 


Synthetic 


5 
6 


<400> 


206 




Phe Leu Arg Val ( 


9 


1 




10 






1 1 


<210> 


207 


12 


<211> 


10 


13 


<212> 


PRT 


14 


<213> 


Synthetic 


15 






16 


<400> 


207 


17 






18 


Asn AST 


1 Tyr Leu I 


19 


1 




20 






21 


<210> 


208 


22 


<211> 


10 


23 


<212> 


PRT 


24 


<213> 


Synthetic 


25 






26 


<400> 


208 


27 






28 


Lys Lys Tyr Leu } 


29 


1 




30 






31 


<210> 


209 


32 


<211> 


10 


33 


<212> 


PRT 


34 


<213> 


Synthetic 


35 






36 


<400> 


209 


37 






38 


Asn Asn Phe Leu 1 


39 


1 




40 






41 


<210> 


210 


42 


<211> 


10 


43 


<212> 


PRT 


44 


<213> 


Synthetic 


45 






46 


<400> 


210 


47 






48 


Asn Al£ 


I Tyr Leu } 


49 


1 




50 








<210> 


211 


52 


<211> 


10 


53 


<212> 


PRT 


54 


<213> 


Synthetic 


55 






56 


<400> 


211 


57 
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Ala Thr Tyr Glu Glu Ala Lys Leu Arg Phe 

10 



2 


1 


5 


3 






4 


<210> 


212 




<211> 


10 


6 


<212> 


PRT 


7 


<213> 


Synthetic 


8 
9 


<400> 


212 


10 






1 1 


Glu Ala Tyr Glu Glu 


12 


1 


5 


13 






14 


<210> 


213 


15 


<211> 


10 


16 


<212> 


PRT 


17 


<213> 


Synthetic 


18 






19 


<400> 


213 


20 






21 


Glu Thr Ala Glu Glu 


22 


1 


5 


23 






24 


<210> 


214 


25 


<211> 


10 


26 


<212> 


PRT 


27 


<213> 


Synthetic 


28 






29 


<400> 


214 


30 






31 


Glu Thr Tyr Ala Glu 


32 


1 


5 


33 






34 


<210> 


215 


35 


<211> 


10 


36 


<212> 


PRT 


37 


<213> 


Synthetic 


38 






39 


<400> 


215 


40 






41 


Glu Thr Tyr Glu Ala 


42 


1 


5 


43 






44 


<210> 


216 


45 


<211> 


10 


46 


<212> 


PRT 


47 


<213> 


Synthetic 


48 






49 


<400> 


216 


50 






51 


Glu Thr Tyr Glu Glu 


52 


1 


5 


53 






54 


<210> 


217 


55 


<211> 


10 


56 


<212> 


PRT 


57 


<213> 


Synthetic 
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<400> 217 

Glu Thr Tyr Glu Glu Ala Leu Ala Arg Phe 
1 5 10 

<210> 218 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 218 

Glu Thr Tyr Glu Glu Ala Leu Lys Ala Phe 
1 5 10 

<210> 219 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 219 

Glu Thr Tyr Glu Glu Ala Leu Lys Arg Ala 
1 5 10 

<210> 220 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 220 

Gly Thr Asn Ala Gin Ser Leu Ala Asp Phe 



<210> 221 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 221 

Glu Thr Tyr Glu Ala Ala Leu Ala Ala Phe 
1 5 10 

<210> 222 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 222 

Glu Thr Phe Glu Glu Ala Leu Lys Arg Phe 
1 5 10 

<210> 223 
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<211> 10 
<212> PRT 
<213> Synthetic 

<400> 223 

Tyr Glu Glu Ala Leu Lys Thr Phe Ala Lys 
15 10 

<210> 224 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 224 

Phe Glu Glu Ala Leu Lys Arg Phe Ala Lys 
1 5 10 

<210> 225 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 225 

Ala Ala Leu Lys Arg Phe Ala Lys Leu Leu 
15 10 

<210> 226 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 226 

Glu Ala Ala Lys Arg Phe Ala Lys Leu Leu 
15 10 

<210> 227 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 227 

Glu Ala Leu Ala Arg Phe Ala Lys Leu Leu 
15 10 

<210> 228 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 228 

Glu Ala Leu Lys Ala Phe Ala Lys Leu Leu 
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1 


1 








229 




<211> 


10 




<212> 


PRT 




<213> 


Synthetic 




<400> 


229 










Glu Ala Leu Lys ; 




1 












<210> 


230 


14 


<211> 


10 




<212> 


PRT 


16 


<213> 


Synthetic 








18 


<400> 


230 


19 
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Glu Ala Leu Lys i 


^' 


1 




22 






23 


<210> 


231 


24 


<211> 
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25 


<212> 


PRT 


26 


<213> 


Synthetic 


27 
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29 






30 


Glu Ala Leu Lys ] 


31 


1 




32 








<210> 


232 


34 


<211> 


10 




<212> 


PRT 


36 


<213> 


Synthetic 


37 
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<400> 


232 


39 
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Glu Ala Leu Lys 1 


41 


1 




42 






43 


<210> 


233 


44 


<211> 


10 


45 


<212> 


PRT 


46 


<213> 


Synthetic 


47 
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Gin Ser Leu Ala 1 




1 




52 






53 


<210> 
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54 


<211> 
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55 


<212> 


PRT 


56 


<213> 


Synthetic 


57 
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3 


Ala Ale 


I Leu Ala Ala 




1 


5 


6 


<210> 


235 


7 


<211> 


10 


8 


<212> 


PRT 


9 


<213> 


Synthetic 


10 






II 


<4 00> 


235 


12 






13 


Leu Ala 


L Asp Phe Ala 


14 


1 


5 


15 






16 


<210> 


236 


17 


<211> 


10 


18 


<212> 


PRT 


19 


<213> 


Synthetic 


20 
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22 






23 


Asp Ph€ 


i Ala Lys Gin 


24 
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5 


25 






26 


<210> 


237 


27 


<211> 


10 


28 


<212> 


PRT 


29 


<213> 


Synthetic 


30 






31 


<400> 


237 


32 






33 


Ala Phe 


; Ala Ala Leu 


34 


1 


5 


35 






36 


<210> 


238 


37 


<211> 


10 


38 


<212> 


PRT 


39 


<213> 


Synthetic 


40 






41 


<400> 


238 


42 






43 


Ser Thi 


: Ala Ala Ala 


44 


1 


5 


45 






46 


<210> 


239 


47 


<211> 


10 


48 


<212> 


PRT 


49 


<213> 


Synthetic 


50 






51 


<400> 


239 


52 






53 


Leu Lys 


1 Arg Phe Ala 


54 


1 


5 


55 






56 


<210> 


240 


57 


<211> 


10 



Arg Lys 
10 
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<212> PRT 
<213> Synthetic 

<400> 240 

Gin Ala Leu Asp Ser Glu Asn Asn Phe Leu 



<210> 241 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 241 

Ser Asp Leu Ala Asp Phe Ala Lys Gin Leu 
15 10 

<210> 242 

<211> 55 

<212> DNA 

<213> Synthetic 



<400> 242 

ggagctcgag aaaagagagg ctgaagcttc attgaattac aaaaaaatgc tgttg 



<210> 243 

<211> 42 

<212> DNA 

<213> Synthetic 

<400> 243 

tcccaactgt cctggtccat aagaagcttt gtttgctcgg ag 



<210> 244 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 244 

gcttcttatg gaccaggaca gttgggagaa atggtg 

<210> 245 

<211> 39 

<212> DNA 

<213> Synthetic 

<400> 245 

ggtctagagg aattctcatt acctaattga agcaaatgc 

<210> 246 

<211> 1128 

<212> DNA 

<213> Synthetic 
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<400> 246 

tcgagaaaag agaggctgaa gcttcattga attacaaaaa aatgctgttg ctctcattag 
gcactggcac tacttcagag tttgataaaa catatacagc aaaagaggca gctacctgga 
ctgctgtaca ttggatgtta gttatacaga aaatgactga tgcagcaagt tcttacatga 
ctgattatta cctttctact gcttttcaag ctcttgattc aaaaaacaat tacctcaggg 
ttcaagaaaa tgcattaaca ggcacaacta ctgaaatgga tgatgcttct gaggctaata 
tggaattatt agtacaagtt ggtgaaaact tattgaagaa accagtttcc gaagacaatc 
ctgaaaccta tgaggaagct ctaaagaggt ttgcaaaatt gctctctgat aggaagaaac 
tccgagcaaa caaagcttct tatggaccag gacagttggg agaaatggtg actgttctta 
gtattgatgg aggtggaatt agagggatca ttccggctac cattctcgaa tttcttgaag 
gacaacttca ggaaatggac aataatgcag atgcaagact tgcagattac tttgatgtaa 
ttggaggaac aagtacagga ggtttattga ctgctatgat aagtactcca aatgaaaaca 
atcgaccctt tgctgctgcc aaagaaattg taccttttta cttcgaacat ggccctcaga 
tttttaatcc tagtggtcaa attttaggcc caaaatatga tggaaaatat cttatgcaag 
ttcttcaaga aaaacttgga gaaactcgtg tgcatcaagc tttgacagaa gttgtcatct 
caagctttga catcaaaaca aataagccag taatattcac taagtcaaat ttagcaaact 
ctccagaatt ggatgctaag atgtatgaca taagttattc cacagcagca gctccaacat 
attttcctcc gcattacttt gttactaata ctagtaatgg agatgaatat gagttcaatc 
ttgttgatgg tgctgttgct actgttgctg atccggcgtt attatccatt agcgttgcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



cgagacttgc acaaaaggat ccagcatttg cttcaattag gtaatgag 1128 

<210> 247 

<211> 366 

<212> PRT 

<213> Synthetic 

<400> 247 

Ser Leu Asn Tyr Lys Lys Met Leu Leu Leu Ser Leu Gly Thr Gly Thr 
15 10 15 

Thr Ser Glu Phe Asp Lys Thr Tyr Thr Ala Lys Glu Ala Ala Thr Trp 
20 25 30 

Thr Ala Val His Trp Met Leu Val He Gin Lys Met Thr Asp Ala Ala 
35 40 45 

Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser Thr Ala Phe Gin Ala Leu 

-135- of -213- 
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Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin Glu Asn Ala Leu Thr Gly 
65 70 75 80 

Thr Thr Thr Glu Met Asp Asp Ala Ser Glu Ala Asn Met Glu Leu Leu 
85 90 95 

Val Gin Val Gly Glu Asn Leu Leu Lys Lys Pro Val Ser Glu Asp Asn 
100 105 110 

Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg Phe Ala Lys Leu Leu Ser 
115 120 125 

Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala Ser Tyr Gly Pro Gly Gin 
130 135 140 

Leu Gly Glu Met Val Thr Val Leu Ser He Asp Gly Gly Gly He Arg 
145 150 155 160 

Gly He He Pro Ala Thr He Leu Glu Phe Leu Glu Gly Gin Leu Gin 
165 170 175 

Glu Met Asp Asn Asn Ala Asp Ala Arg Leu Ala Asp Tyr Phe Asp Val 
180 185 190 

He Gly Gly Thr Ser Thr Gly Gly Leu Leu Thr Ala Met He Ser Thr 
195 200 205 

Pro Asn Glu Asn Asn Arg Pro Phe Ala Ala Ala Lys Glu He Val Pro 
210 215 220 

Phe Tyr Phe Glu His Gly Pro Gin He Phe Asn Pro Ser Gly Gin He 
225 230 235 240 

Leu Gly Pro Lys Tyr Asp Gly Lys Tyr Leu Met Gin Val Leu Gin Glu 
245 250 255 

Lys Leu Gly Glu Thr Arg Val His Gin Ala Leu Thr Glu Val Val He 
260 265 270 

Ser Ser Phe Asp He Lys Thr Asn Lys Pro Val He Phe Thr Lys Ser 
275 280 285 

Asn Leu Ala Asn Ser Pro Glu Leu Asp Ala Lys Met Tyr Asp He Ser 
290 295 300 

Tyr Ser Thr Ala Ala Ala Pro Thr Tyr Phe Pro Pro His Tyr Phe Val 
305 310 315 320 

Thr Asn Thr Ser Asn Gly Asp Glu Tyr Glu Phe Asn Leu Val Asp Gly 
325 330 335 

Ala Val Ala Thr Val Ala Asp Pro Ala Leu Leu Ser He Ser Val Ala 
340 345 350 



Thr Arg Leu Ala Gin Lys Asp Pro Ala Phe Ala Ser He Arg 
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355 
<210> 248 
<211> 55 
<212> DNA 
<213> Synthetic 

<400> 248 



ggagctcgag 


aaaagagagg 


ctgaagctaa 


tactagtaat 


ggagatgaat 


atgag 


55 


<210> 249 
<211> 39 
<212> DNA 
<213> Synthetic 












<400> 249 
ggtctagagg 


aattctcatt 


aagtaacaaa 


gtaatgcgg 






39 


<210> 250 
<211> 1128 
<212> DNA 
<213> Synthetic 












<400> 250 
tcgagaaaag 


agaggctgaa 


gctaatacta 


gtaatggaga 


tgaatatgag 


ttcaatcttg 


60 


ttgatggtgc 


tgttgctact 


gttgctgatc 


cggcgttatt 


atccattagc 


gttgcaacga 


120 


gacttgcaca 


aaaggatcca 


gcatttgctt 


caattaggtc 


attgaattac 


aaaaaaatgc 


180 


tgttgctctc 


attaggcact 


ggcactactt 


cagagtttga 


taaaacatat 


acagcaaaag 


240 


aggcagctac 


ctggactgct 


gtacattgga 


tgttagttat 


acagaaaatg 


actgatgcag 


300 


caagttctta 


catgactgat 


tattaccttt 


ctactgcttt 


tcaagctctt 


gattcaaaaa 


360 


acaattacct 


cagggttcaa 


gaaaatgcat 


taacaggcac 


aactactgaa 


atggatgatg 


420 


cttctgaggc 


taatatggaa 


ttattagtac 


aagttggtga 


aaacttattg 


aagaaaccag 


480 


tttccgaaga 


caatcctgaa 


acctatgagg 


aagctctaaa 


gaggtttgca 


aaattgctct 


540 


ctgataggaa 


gaaactccga 


gcaaacaaag 


cttcttatgg 


accaggacag 


ttgggagaaa 


600 


tggtgactgt 


tcttagtatt 


gatggaggtg 


gaattagagg 


gatcattccg 


gctaccattc 


660 


tcgaatttct 


tgaaggacaa 


cttcaggaaa 


tggacaataa 


tgcagatgca 


agacttgcag 


720 


attactttga 


tgtaattgga 


ggaacaagta 


caggaggttt 


attgactgct 


atgataagta 


780 


ctccaaatga 


aaacaatcga 


ccctttgctg 


ctgccaaaga 


aattgtacct 


ttttacttcg 


840 


aacatggccc 


tcagattttt 


aatcctagtg 


gtcaaatttt 


aggcccaaaa 


tatgatggaa 


900 


aatatcttat 


gcaagttctt 


caagaaaaac 


ttggagaaac 


tcgtgtgcat 


caagctttga 


960 


cagaagttgt 


catctcaagc 


tttgacatca 


aaacaaataa 


gccagtaata 


ttcactaagt 


1020 
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I caaatttagc aaactctcca gaattggatg ctaagatgta tgacataagt tattccacag 1080 
2 

3 cagcagctcc aacatatttt cctccgcatt actttgttac ttaatgag 1128 

5 <210> 251 

6 <211> 366 

7 <212> PRT 

8 <213> Synthetic 
9 

10 <400> 251 
11 

12 Asn Thr Ser Asn Gly Asp Glu Tyr Glu Phe Asn Leu Val Asp Gly Ala 

13 1 5 10 15 
14 

15 Val Ala Thr Val Ala Asp Pro Ala Leu Leu Ser He Ser Val Ala Thr 

16 20 25 30 
17 

18 Arg Leu Ala Gin Lys Asp Pro Ala Phe Ala Ser He Arg Ser Leu Asn 

19 35 40 45 
20 

21 Tyr Lys Lys Met Leu Leu Leu Ser Leu Gly Thr Gly Thr Thr Ser Glu 

22 5 0 5 5 6 0 
23 

24 Phe Asp Lys Thr Tyr Thr Ala Lys Glu Ala Ala Thr Trp Thr Ala Val 

25 65 70 75 80 
26 

27 His Trp Met Leu Val He Gin Lys Met Thr Asp Ala Ala Ser Ser Tyr 

28 85 90 95 
29 

30 Met Thr Asp Tyr Tyr Leu Ser Thr Ala Phe Gin Ala Leu Asp Ser Lys 

31 100 105 110 
32 

33 Asn Asn Tyr Leu Arg Val Gin Glu Asn Ala Leu Thr Gly Thr Thr Thr 

34 115 1 20 1 25 
35 

36 Glu Met Asp Asp Ala Ser Glu Ala Asn Met Glu Leu Leu Val Gin Val 

37 130 135 140 
38 

39 Gly Glu Asn Leu Leu Lys Lys Pro Val Ser Glu Asp Asn Pro Glu Thr 

40 145 150 155 160 
41 

42 Tyr Glu Glu Ala Leu Lys Arg Phe Ala Lys Leu Leu Ser Asp Arg Lys 

43 165 170 175 
44 

45 Lys Leu Arg Ala Asn Lys Ala Ser Tyr Gly Pro Gly Gin Leu Gly Glu 

46 180 185 190 
47 

48 Met Val Thr Val Leu Ser He Asp Gly Gly Gly He Arg Gly He He 

49 195 200 205 
50 

51 Pro Ala Thr He Leu Glu Phe Leu Glu Gly Gin Leu Gin Glu Met Asp 

52 210 215 220 
53 

54 Asn Asn Ala Asp Ala Arg Leu Ala Asp Tyr Phe Asp Val He Gly Gly 

55 2 2 5 2 3 0 2 3 5 2 4 0 
56 

57 Thr Ser Thr Gly Gly Leu Leu Thr Ala Met He Ser Thr Pro Asn Glu 
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Asn Asn Arg Pro Phe Ala Ala Ala Lys Glu lie Val Pro Phe Tyr Phe 
260 265 270 

Glu His Gly Pro Gin lie Phe Asn Pro Ser Gly Gin lie Leu Gly Pro 
275 280 285 

Lys Tyr Asp Gly Lys Tyr Leu Met Gin Val Leu Gin Glu Lys Leu Gly 
290 295 300 

Glu Thr Arg Val His Gin Ala Leu Thr Glu Val Val lie Ser Ser Phe 
305 310 315 320 

Asp lie Lys Thr Asn Lys Pro Val lie Phe Thr Lys Ser Asn Leu Ala 
325 330 335 

Asn Ser Pro Glu Leu Asp Ala Lys Met Tyr Asp lie Ser Tyr Ser Thr 
340 345 350 

Ala Ala Ala Pro Thr Tyr Phe Pro Pro His Tyr Phe Val Thr 
355 360 365 

<210> 252 

<211> 55 

<212> DNA 

<213> Synthetic 

<400> 252 

ggagctcgag aaaagagagg ctgaagctag ttattccaca gcagcagctc caaca 55 

<210> 253 

<211> 39 

<212> DNA 

<213> Synthetic 

<400> 253 

ggtctagagg aattctcatt atatgtcata catcttagc 39 

<210> 254 

<211> 1128 

<212> DNA 

<213> Synthetic 

<400> 254 

tcgagaaaag agaggctgaa gctagttatt ccacagcagc agctccaaca tattttcctc 60 

cgcattactt tgttactaat actagtaatg gagatgaata tgagttcaat cttgttgatg 120 

gtgctgttgc tactgttgct gatccggcgt tattatccat tagcgttgca acgagacttg 180 

cacaaaagga tccagcattt gcttcaatta ggtcattgaa ttacaaaaaa atgctgttgc 240 

tctcattagg cactggcact acttcagagt ttgataaaac atatacagca aaagaggcag 300 

ctacctggac tgctgtacat tggatgttag ttatacagaa aatgactgat gcagcaagtt 360 
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1 cttacatgac tgattattac ctttctactg cttttcaagc tcttgattca aaaaacaatt 420 
2 

3 acctcagggt tcaagaaaat gcattaacag gcacaactac tgaaatggat gatgcttctg 480 
4 

5 aggctaatat ggaattatta gtacaagttg gtgaaaactt attgaagaaa ccagtttccg 540 
6 

7 aagacaatcc tgaaacctat gaggaagctc taaagaggtt tgcaaaattg ctctctgata 600 
8 

9 ggaagaaact ccgagcaaac aaagcttctt atggaccagg acagttggga gaaatggtga 660 
10 

11 ctgttcttag tattgatgga ggtggaatta gagggatcat tccggctacc attctcgaat 720 
12 

13 ttcttgaagg acaacttcag gaaatggaca ataatgcaga tgcaagactt gcagattact 780 

15 ttgatgtaat tggaggaaca agtacaggag gtttattgac tgctatgata agtactccaa 840 
16 

17 atgaaaacaa tcgacccttt gctgctgcca aagaaattgt acctttttac ttcgaacatg 900 
18 

19 gccctcagat ttttaatcct agtggtcaaa ttttaggccc aaaatatgat ggaaaatatc 960 
20 

21 ttatgcaagt tcttcaagaa aaacttggag aaactcgtgt gcatcaagct ttgacagaag 1020 
22 

23 ttgtcatctc aagctttgac atcaaaacaa ataagccagt aatattcact aagtcaaatt 1080 
24 

25 tagcaaactc tccagaattg gatgctaaga tgtatgacat ataatgag 1128 
26 

27 <210> 255 

28 <211> 366 

29 <212> PRT 

30 <213> Synthetic 
31 

32 <400> 255 
33 

34 Ser Tyr Ser Thr Ala Ala Ala Pro Thr Tyr Phe Pro Pro His Tyr Phe 

35 1 5 10 15 
36 

37 Val Thr Asn Thr Ser Asn Gly Asp Glu Tyr Glu Phe Asn Leu Val Asp 

38 2 0 2 5 3 0 
39 

40 Gly Ala Val Ala Thr Val Ala Asp Pro Ala Leu Leu Ser He Ser Val 

41 35 40 45 
42 

43 Ala Thr Arg Leu Ala Gin Lys Asp Pro Ala Phe Ala Ser He Arg Ser 

44 5 0 5 5 6 0 
45 

46 Leu Asn Tyr Lys Lys Met Leu Leu Leu Ser Leu Gly Thr Gly Thr Thr 

47 6 5 7 0 7 5 8 0 
48 

49 Ser Glu Phe Asp Lys Thr Tyr Thr Ala Lys Glu Ala Ala Thr Trp Thr 

50 85 90 95 
51 

52 Ala Val His Trp Met Leu Val He Gin Lys Met Thr Asp Ala Ala Ser 

53 100 105 110 
54 

55 Ser Tyr Met Thr Asp Tyr Tyr Leu Ser Thr Ala Phe Gin Ala Leu Asp 

56 115 120 125 
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Ser Lys Asn Asn Tyr Leu Arg Val Gin Glu Asn Ala Leu Thr Gly Thr 
130 135 140 

Thr Thr Glu Met Asp Asp Ala Ser Glu Ala Asn Met Glu Leu Leu Val 
145 150 155 160 

Gin Val Gly Glu Asn Leu Leu Lys Lys Pro Val Ser Glu Asp Asn Pro 
165 170 175 

Glu Thr Tyr Glu Glu Ala Leu Lys Arg Phe Ala Lys Leu Leu Ser Asp 
180 185 190 

Arg Lys Lys Leu Arg Ala Asn Lys Ala Ser Tyr Gly Pro Gly Gin Leu 
195 200 205 

Gly Glu Met Val Thr Val Leu Ser He Asp Gly Gly Gly He Arg Gly 
210 215 220 

He He Pro Ala Thr He Leu Glu Phe Leu Glu Gly Gin Leu Gin Glu 
225 230 235 240 

Met Asp Asn Asn Ala Asp Ala Arg Leu Ala Asp Tyr Phe Asp Val He 
245 250 255 

Gly Gly Thr Ser Thr Gly Gly Leu Leu Thr Ala Met He Ser Thr Pro 
260 265 270 

Asn Glu Asn Asn Arg Pro Phe Ala Ala Ala Lys Glu He Val Pro Phe 
275 280 285 

Tyr Phe Glu His Gly Pro Gin He Phe Asn Pro Ser Gly Gin He Leu 
290 295 300 

Gly Pro Lys Tyr Asp Gly Lys Tyr Leu Met Gin Val Leu Gin Glu Lys 
305 310 315 320 

Leu Gly Glu Thr Arg Val His Gin Ala Leu Thr Glu Val Val He Ser 
325 330 335 

Ser Phe Asp He Lys Thr Asn Lys Pro Val He Phe Thr Lys Ser Asn 
340 345 350 

Leu Ala Asn Ser Pro Glu Leu Asp Ala Lys Met Tyr Asp He 
355 360 365 

<210> 256 

<211> 55 

<212> DNA 

<213> Synthetic 

<400> 256 

ggagctcgag aaaagagagg ctgaagctac atatacagca aaagaggcag ctacc 

<210> 257 

<211> 39 

<212> DNA 

<213> Synthetic 
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<400> 257 

ggtctagagg aattctcatt atttatcaaa ctctgaagt 39 

<210> 258 

<211> 1128 

<212> DNA 

<213> Synthetic 

<400> 258 

tcgagaaaag agaggctgaa gctacatata cagcaaaaga ggcagctacc tggactgctg 60 

tacattggat gttagttata cagaaaatga ctgatgcagc aagttcttac atgactgatt 120 

attacctttc tactgctttt caagctcttg attcaaaaaa caattacctc agggttcaag 180 

aaaatgcatt aacaggcaca actactgaaa tggatgatgc ttctgaggct aatatggaat 240 

tattagtaca agttggtgaa aacttattga agaaaccagt ttccgaagac aatcctgaaa 300 

cctatgagga agctctaaag aggtttgcaa aattgctctc tgataggaag aaactccgat 350 

caaacaaagc ttcttatgga ccaggacagt tgggagaaat ggtgactgtt cttagtattg 420 

atggaggtgg aattagaggg atcattccgg ctaccattct cgaatttctt gaaggacaac 480 

ttcaggaaat ggacaataat gcagatgcaa gacttgcaga ttactttgat gtaattggag 540 

gaacaagtac aggaggttta ttgactgcta tgataagtac tccaaatgaa aacaatcgac 600 

cctttgctgc tgccaaagaa attgtacctt tttacttcga acatggccct cagattttta 560 

atcctagtgg tcaaatttta ggcccaaaat atgatggaaa atatcttatg caagttcttc 720 

aagaaaaact tggagaaact cgtgtgcatc aagctttgac agaagttgtc atctcaagct 780 

ttgacatcaa aacaaataag ccagtaatat tcactaagtc aaatttagca aactctccag 840 

aattggatgc taagatgtat gacataagtt attccacagc agcagctcca acatattttc 900 

ctccgcatta ctttgttact aatactagta atggagatga atatgagttc aatcttgttg 960 

atggtgctgt tgctactgtt gctgatccgg cgttattatc cattagcgtt gcaacgagac 1020 

ttgcacaaaa ggatccagca tttgcttcaa ttaggtcatt gaattacaaa aaaatgctgt 1080 

tgctctcatt aggcactggc actacttcag agtttgataa ataatgag 1128 

<210> 259 

<211> 366 

<212> PRT 

<213> Synthetic 

<400> 259 

Thr Tyr Thr Ala Lys Glu Ala Ala Thr Trp Thr Ala Val His Trp Met 
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Leu Val He Gin Lys Met Thr Asp Ala Ala Ser Ser Tyr Met Thr i 
20 25 30 



Tyr Tyr Leu Ser Thr Ala Phe Gin Ala Leu Asp Ser Lys Asn Asn Tyr 



Leu Arg Val Gin Glu Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp 
50 55 50 



Asp Ala Ser Glu Ala Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn 
65 70 75 80 



Leu Leu Lys Lys Pro Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu 



Ala Leu Lys Arg Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg 
100 105 110 



Ser Asn Lys Ala Ser Tyr Gly Pro Gly Gin Leu Gly Glu Met Val Thr 
115 120 125 



Val Leu Ser He Asp Gly Gly Gly He Arg Gly He He Pro Ala Thr 
130 135 140 



He Leu Glu Phe Leu Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala 
145 150 155 160 



Asp Ala Arg Leu Ala Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr 
165 170 175 



Gly Gly Leu Leu Thr Ala Met He Ser Thr Pro Asn Glu Asn Asn Arg 
180 185 190 



Pro Phe Ala Ala Ala Lys Glu He Val Pro Phe Tyr Phe Glu His Gly 
195 200 205 



Pro Gin He Phe Asn Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp 
210 215 220 



Gly Lys Tyr Leu Met Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg 
225 230 235 240 



Val His Gin Ala Leu Thr Glu Val Val He Ser Ser Phe Asp He Lys 
245 250 255 



Thr Asn Lys Pro Val He Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro 
260 265 270 



Glu Leu Asp Ala Lys Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala 
275 280 285 



Pro Thr Tyr Phe Pro Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly 
290 295 300 



Asp Glu Tyr Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala 
305 310 315 320 
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Asp Pro Ala Leu Leu Ser lie Ser Val Ala Thr Arg Leu Ala Gin Lys 
325 330 335 

Asp Pro Ala Phe Ala Ser lie Arg Ser Leu Asn Tyr Lys Lys Met Leu 
340 345 350 

Leu Leu Ser Leu Gly Thr Gly Thr Thr Ser Glu Phe Asp Lys 
355 360 365 

<210> 260 

<211> 55 

<212> DNA 

<213> Synthetic 

<400> 260 

ggagctcgag aaaagagagg ctgaagctaa tgcattaaca ggcacaacta ctgaa 55 

<210> 261 

<211> 39 

<212> DNA 

<213> Synthetic 

<400> 261 

ggtctagagg aattctcatt attcttgaac cctgaggta 39 

<210> 262 

<211> 1128 

<212> DNA 

<213> Synthetic 

<400> 262 

tcgagaaaag agaggctgaa gctaatgcat taacaggcac aactactgaa atggatgatg 60 

cttctgaggc taatatggaa ttattagtac aagttggtga aaacttattg aagaaaccag 120 

tttccgaaga caatcctgaa acctatgagg aagctctaaa gaggtttgca aaattgctct 180 

ctgataggaa gaaactccga gcaaacaaag cttcttatgg accaggacag ttgggagaaa 240 

tggtgactgt tcttagtatt gatggaggtg gaattagagg gatcattccg gctaccattc 300 

tcgaatttct tgaaggacaa cttcaggaaa tggacaataa tgcagatgca agacttgcag 360 

attactttga tgtaattgga ggaacaagta caggaggttt attgactgct atgataagta 420 

ctccaaatga aaacaatcga ccctttgctg ctgccaaaga aattgtacct ttttacttcg 480 

aacatggccc tcagattttt aatcctagtg gtcaaatttt aggcccaaaa tatgatggaa 540 

aatatcttat gcaagttctt caagaaaaac ttggagaaac tcgtgtgcat caagctttga 600 

cagaagttgt catctcaagc tttgacatca aaacaaataa gccagtaata ttcactaagt 660 

caaatttagc aaactctcca gaattggatg ctaagatgta tgacataagt tattccacag 720 

cagcagctcc aacatatttt cctccgcatt actttgttac taatactagt aatggagatg 780 
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2 aatatgagtt caatcttgtt gatggtgctg ttgctactgt tgctgatccg gcgttattat 840 
3 

4 ccattagcgt tgcaacgaga cttgcacaaa aggatccagc atttgcttca attaggtcat 900 
5 

6 tgaattacaa aaaaatgctg ttgctctcat taggcactgg cactacttca gagtttgata 960 
7 

8 aaacatatac agcaaaagag gcagctacct ggactgctgt acattggatg ttagttatac 1020 
9 

10 agaaaatgac tgatgcagca agttcttaca tgactgatta ttacctttct actgcttttc 1080 
11 

12 aagctcttga ttcaaaaaac aattacctca gggttcaaga ataatgag 1128 
13 

14 <210> 263 

15 <211> 366 

16 <212> PRT 

17 <213> Synthetic 
18 

19 <400> 263 
20 

21 Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu Ala 

22 1 5 10 15 
23 

24 Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu Lys Lys Pro 

25 20 25 30 
26 

27 Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg Phe 

28 35 40 45 
29 

30 Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala Ser 

31 50 55 60 
32 

33 Tyr Gly Pro Gly Gin Leu Gly Glu Met Val Thr Val Leu Ser He Asp 

34 6 5 7 0 7 5 8 0 
35 

36 Gly Gly Gly He Arg Gly He He Pro Ala Thr He Leu Glu Phe Leu 

37 85 90 95 
38 

39 Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala Asp Ala Arg Leu Ala 

40 100 105 110 
41 

42 Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly Leu Leu Thr 

43 115 1 20 1 25 
44 

45 Ala Met He Ser Thr Pro Asn Glu Asn Asn Arg Pro Phe Ala Ala Ala 

46 130 135 140 
47 

48 Lys Glu He Val Pro Phe Tyr Phe Glu His Gly Pro Gin He Phe Asn 

49 1 45 1 50 1 55 1 60 
50 

51 Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp Gly Lys Tyr Leu Met 

52 1 65 1 70 1 75 
53 

54 Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His Gin Ala Leu 

55 180 185 190 
56 

57 Thr Glu Val Val He Ser Ser Phe Asp He Lys Thr Asn Lys Pro Val 
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lie Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro Glu Leu Asp Ala Lys 
210 215 220 

Met Tyr Asp lie Ser Tyr Ser Thr Ala Ala Ala Pro Thr Tyr Phe Pro 
225 230 235 240 

Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly Asp Glu Tyr Glu Phe 
245 250 255 

Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala Asp Pro Ala Leu Leu 
260 265 270 

Ser lie Ser Val Ala Thr Arg Leu Ala Gin Lys Asp Pro Ala Phe Ala 
275 280 285 

Ser lie Arg Ser Leu Asn Tyr Lys Lys Met Leu Leu Leu Ser Leu Gly 
290 295 300 

Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr Thr Ala Lys Glu Ala 
305 310 315 320 

Ala Thr Trp Thr Ala Val His Trp Met Leu Val He Gin Lys Met Thr 
325 330 335 

Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser Thr Ala Phe 
340 345 350 

Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin Glu 
355 360 365 

<210> 264 

<211> 1158 

<212> DNA 

<213> Synthetic 

<400> 264 

atggccacca ccaagagctt cctcatcctg atcttcatga tcctggccac caccagcagc 60 

accttcgccc agctcggcga gatggtgacc gtgctctcca tcgacggcgg tggcatcagg 120 

ggcatcatcc cggccaccat cctggagttc ctggagggcc aactccagga gatggacaac 180 

aacgccgacg cccgcctggc cgactacttc gacgtgatcg gtggcaccag caccggcggt 240 

ctcctgaccg ccatgatctc cactccgaac gagaacaacc gccccttcgc cgctgcgaag 300 

gagatcgtcc cgttctactt cgaacacggc cctcagattt tcaacccctc gggtcaaatc 360 

ctgggcccca agtacgacgg caagtacctt atgcaagtgc ttcaggagaa gctgggcgag 420 

actagggtgc accaggcgct gaccgaggtc gtcatctcca gcttcgacat caagaccaac 480 

aagccagtca tcttcaccaa gtccaacctg gccaacagcc cggagctgga cgctaagatg 540 

tacgacatct cctactccac tgctgccgct cccacgtact tccctccgca ctacttcgtc 600 
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accaacacca 


gcaacggcga 


cgagtacgag 


ttcaaccttg 


ttgacggtgc 


ggtggctacg 


gtggcggacc 


cggcgctcct 


gtccatcagc 


gtcgccacgc 


gcctggccca 


gaaggatcca 


gccttcgcta 


gcattaggag 


cctcaactac 


aagaagatgc 


tgctgctcag 


cctgggcact 


ggcacgacct 


ccgagttcga 


caagacctac 


actgccaagg 


aggccgctac 


ctggaccgcc 


gtccattgga 


tgctggtcat 


ccagaagatg 


acggacgccg 


cttccagcta 


catgaccgac 


tactacctct 


ccactgcgtt 


ccaggcgctt 


gactccaaga 


acaactacct 


ccgtgttcag 


gagaatgccc 


tcactggcac 


cacgaccgag 


atggacgatg 


cctccgaggc 


caacatggag 


ctgctcgtcc 


aggtgggtga 


gaacctcctg 


aagaagcccg 


tctccgaaga 


caatcccgag 


acctatgagg 


aagcgctcaa 


gcgctttgcc 


aagctgctct 


ctgataggaa 


gaaactccgc 


gctaacaagg 


ccagctac 











<210> 265 

<211> 386 

<212> PRT 

<213> Synthetic 

<400> 265 

Met Ala Thr Thr Lys Ser Phe Leu lie Leu lie Phe Met lie Leu Ala 
15 10 15 

Thr Thr Ser Ser Thr Phe Ala Gin Leu Gly Glu Met Val Thr Val Leu 



Ser lie Asp Gly Gly Gly lie Arg Gly lie He Pro Ala Thr He Leu 
35 40 45 

Glu Phe Leu Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala Asp Ala 
50 55 60 

Arg Leu Ala Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly 
65 70 75 80 

Leu Leu Thr Ala Met He Ser Thr Pro Asn Glu Asn Asn Arg Pro Phe 
85 90 95 

Ala Ala Ala Lys Glu He Val Pro Phe Tyr Phe Glu His Gly Pro Gin 
100 105 110 

He Phe Asn Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp Gly Lys 
115 120 125 

Tyr Leu Met Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His 
130 135 140 

Gin Ala Leu Thr Glu Val Val He Ser Ser Phe Asp He Lys Thr Asn 
145 150 155 160 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1158 
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2 Lys Pro Val lie Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro Glu Leu 

3 165 170 175 
4 

5 Asp Ala Lys Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala Pro Thr 

6 180 185 190 
7 

8 Tyr Phe Pro Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly Asp Glu 

9 195 200 205 
10 

11 Tyr Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala Asp Pro 

12 210 215 220 
13 

14 Ala Leu Leu Ser He Ser Val Ala Thr Arg Leu Ala Gin Lys Asp Pro 

15 225 230 235 240 
16 

17 Ala Phe Ala Ser He Arg Ser Leu Asn Tyr Lys Lys Met Leu Leu Leu 

18 245 250 255 
19 

20 Ser Leu Gly Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr Thr Ala 

21 260 265 270 
22 

23 Lys Glu Ala Ala Thr Trp Thr Ala Val His Trp Met Leu Val He Gin 

24 2 7 5 2 8 0 2 8 5 
25 

26 Lys Met Thr Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser 

27 290 295 300 
28 

29 Thr Ala Phe Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin 

30 305 310 315 320 
31 

32 Glu Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu 

33 325 330 335 
34 

35 Ala Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu Lys Lys 

36 340 345 350 
37 

38 Pro Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg 

39 355 360 365 
40 

41 Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala 

42 3 7 0 3 7 5 3 8 0 
43 

44 Ser Tyr 

45 3 8 5 
46 

47 <210> 266 

48 <211> 55 

49 <212> DNA 

50 <213> Synthetic 
51 

52 <400> 266 

53 ggagctcgag aaaagagagg ctgaagctag cctcaactac aagaagatgc tgctg 
54 

55 <210> 267 

56 <211> 42 

57 <212> DNA 
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1 <213> Synthetic 
2 

3 <400> 267 

4 gccgagctgt cctggtccgt agctggcctt gttagcgcgg ag 42 
5 

6 <210> 268 

7 <211> 36 

8 <212> DMA 

9 <213> Synthetic 
10 

11 <400> 268 

12 gccagctacg gaccaggaca gctcggcgag atggtg 36 
13 

14 <210> 269 

15 <211> 39 

16 <212> DNA 

17 <213> Synthetic 
18 

19 <400> 269 

20 ggtctagagg aattctcatt acctaatgct agcgaaggc 39 
21 

22 <210> 270 

23 <211> 1167 

24 <212> DNA 

25 <213> Synthetic 
26 

27 <400> 270 

28 atggccacca ccaagagctt cctcatcctg atcttcatga tcctggccac caccagcagc 60 
29 

30 accttcgcca gcctcaacta caagaagatg ctgctgctca gcctgggcac tggcacgacc 120 
31 

32 tccgagttcg acaagaccta cactgccaag gaggccgcta cctggaccgc cgtccattgg 180 
33 

34 atgctggtca tccagaagat gacggacgcc gcttccagct acatgaccga ctactacctc 240 
35 

36 tccactgcgt tccaggcgct tgactccaag aacaactacc tccgtgttca ggagaatgcc 300 
37 

38 ctcactggca ccacgaccga gatggacgat gcctccgagg ccaacatgga gctgctcgtc 360 
39 

40 caggtgggtg agaacctcct gaagaagccc gtctccgaag acaatcccga gacctatgag 420 
41 

42 gaagcgctca agcgctttgc caagctgctc tctgatagga agaaactccg cgctaacaag 480 
43 

44 gccagctacg gaccaggaca gctcggcgag atggtgaccg tgctctccat cgacggcggt 540 
45 

46 ggcatcaggg gcatcatccc ggccaccatc ctggagttcc tggagggcca actccaggag 600 
47 

48 atggacaaca acgccgacgc ccgcctggcc gactacttcg acgtgatcgg tggcaccagc 660 
49 

50 accggcggtc tcctgaccgc catgatctcc actccgaacg agaacaaccg ccccttcgcc 720 
51 

52 gctgcgaagg agatcgtccc gttctacttc gaacacggcc ctcagatttt caacccctcg 780 
53 

54 ggtcaaatcc tgggccccaa gtacgacggc aagtacctta tgcaagtgct tcaggagaag 84 0 
55 

56 ctgggcgaga ctagggtgca ccaggcgctg accgaggtcg tcatctccag cttcgacatc 900 
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aagaccaaca agccagtcat cttcaccaag tccaacctgg ccaacagccc ggagctggac 960 

gctaagatgt acgacatctc ctactccact gctgccgctc ccacgtactt ccctccgcac 1020 

tacttcgtca ccaacaccag caacggcgac gagtacgagt tcaaccttgt tgacggtgcg 1080 

gtggctacgg tggcggaccc ggcgctcctg tccatcagcg tcgccacgcg cctggcccag 1140 

aaggatccag ccttcgctag cattagg 1167 

<210> 271 

<211> 389 

<212> PRT 

<213> Synthetic 

<400> 271 

Met Ala Thr Thr Lys Ser Phe Leu lie Leu lie Phe Met lie Leu Ala 
15 10 15 

Thr Thr Ser Ser Thr Phe Ala Ser Leu Asn Tyr Lys Lys Met Leu Leu 
20 25 30 

Leu Ser Leu Gly Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr Thr 



Ala Lys Glu Ala Ala Thr Trp Thr Ala Val His Trp Met Leu Val He 
50 55 60 

Gin Lys Met Thr Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu 
65 70 75 80 

Ser Thr Ala Phe Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val 



Gin Glu Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala Ser 
100 105 110 

Glu Ala Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu Lys 
115 120 125 

Lys Pro Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys 
130 135 140 

Arg Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys 
145 150 155 160 

Ala Ser Tyr Gly Pro Gly Gin Leu Gly Glu Met Val Thr Val Leu Ser 
165 170 175 

He Asp Gly Gly Gly He Arg Gly He He Pro Ala Thr He Leu Glu 
180 185 190 

Phe Leu Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala Asp Ala Arg 
195 200 205 

Leu Ala Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly Leu 
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1 210 215 220 

2 

3 Leu Thr Ala Met lie Ser Thr Pro Asn Glu Asn Asn Arg Pro Phe Ala 

4 225 230 235 240 

6 Ala Ala Lys Glu He Val Pro Phe Tyr Phe Glu His Gly Pro Gin He 

7 245 250 255 
8 

9 Phe Asn Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp Gly Lys Tyr 

10 260 265 270 
11 

12 Leu Met Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His Gin 

13 275 280 285 
14 

15 Ala Leu Thr Glu Val Val He Ser Ser Phe Asp He Lys Thr Asn Lys 

16 290 295 300 
17 

18 Pro Val He Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro Glu Leu Asp 

19 305 310 315 320 
20 

21 Ala Lys Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala Pro Thr Tyr 

22 3 2 5 3 3 0 3 3 5 
23 

24 Phe Pro Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly Asp Glu Tyr 

25 340 345 350 
26 

27 Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala Asp Pro Ala 

28 3 5 5 3 6 0 3 6 5 
29 

30 Leu Leu Ser He Ser Val Ala Thr Arg Leu Ala Gin Lys Asp Pro Ala 

31 370 375 380 
32 

33 Phe Ala Ser He Arg 

34 385 
35 

36 <210> 272 

37 <211> 55 

38 <212> DNA 

39 <213> Synthetic 
40 

41 <4Q0> 272 

42 ggagctcgag aaaagagagg ctgaagctac tgccaaggag gccgctacct ggacc 55 
43 

44 <210> 273 

45 <211> 39 

46 <212> DNA 

47 <213> Synthetic 
48 

49 <400> 273 

50 ggtctagagg aattctcatt acttgtcgaa ctcggaggt 39 
51 

52 <210> 274 

53 <211> 1167 

54 <212> DNA 

55 <213> Synthetic 
56 

57 <400> 274 
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1 atggccacca ccaagagctt cctcatcctg atcttcatga tcctggccac caccagcagc 60 
2 

3 accttcgcca cctacactgc caaggaggcc gctacctgga ccgccgtcca ttggatgctg 120 

5 gtcatccaga agatgacgga cgccgcttcc agctacatga ccgactacta cctctccact 180 
6 

7 gcgttccagg cgcttgactc caagaacaac tacctccgtg ttcaggagaa tgccctcact 240 
8 

9 ggcaccacga ccgagatgga cgatgcctcc gaggccaaca tggagctgct cgtccaggtg 300 
10 

II ggtgagaacc tcctgaagaa gcccgtctcc gaagacaatc ccgagaccta tgaggaagcg 360 
12 

13 ctcaagcgct ttgccaagct gctctctgat aggaagaaac tccgcgctaa caaggccagc 420 
14 

15 tacggaccag gacagctcgg cgagatggtg accgtgctct ccatcgacgg cggtggcatc 480 
16 

17 aggggcatca tcccggccac catcctggag ttcctggagg gccaactcca ggagatggac 540 

19 aacaacgccg acgcccgcct ggccgactac ttcgacgtga tcggtggcac cagcaccggc 600 
20 

21 ggtctcctga ccgccatgat ctccactccg aacgagaaca accgcccctt cgccgctgcg 660 
22 

23 aaggagatcg tcccgttcta cttcgaacac ggccctcaga ttttcaaccc ctcgggtcaa 720 
24 

25 atcctgggcc ccaagtacga cggcaagtac cttatgcaag tgcttcagga gaagctgggc 780 
26 

27 gagactaggg tgcaccaggc gctgaccgag gtcgtcatct ccagcttcga catcaagacc 840 
28 

29 aacaagccag tcatcttcac caagtccaac ctggccaaca gcccggagct ggacgctaag 900 
30 

31 atgtacgaca tctcctactc cactgctgcc gctcccacgt acttccctcc gcactacttc 960 
32 

33 gtcaccaaca ccagcaacgg cgacgagtac gagttcaacc ttgttgacgg tgcggtggct 1020 
34 

35 acggtggcgg acccggcgct cctgtccatc agcgtcgcca cgcgcctggc ccagaaggat 1080 
36 

37 ccagccttcg ctagcattag gagcctcaac tacaagaaga tgctgctgct cagcctgggc 1140 
38 

39 actggcacga cctccgagtt cgacaag 1167 
40 

41 <210> 275 

42 <211> 389 

43 <212> PRT 

44 <213> Synthetic 
45 

46 <400> 275 
47 

48 Met Ala Thr Thr Lys Ser Phe Leu lie Leu He Phe Met He Leu Ala 

49 1 5 10 15 
50 

51 Thr Thr Ser Ser Thr Phe Ala Thr Tyr Thr Ala Lys Glu Ala Ala Thr 

52 20 25 30 
53 

54 Trp Thr Ala Val His Trp Met Leu Val He Gin Lys Met Thr Asp Ala 

55 35 40 45 
56 

57 Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser Thr Ala Phe Gin Ala 
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Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin Glu Asn Ala Leu Thr 
65 70 75 80 



Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu Ala Asn Met Glu Leu 
85 90 95 



Leu Val Gin Val Gly Glu Asn Leu Leu Lys Lys Pro Val Ser Glu Asp 
100 105 110 



Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg Phe Ala Lys Leu Leu 
115 120 125 



Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala Ser Tyr Gly Pro Gly 
130 135 140 



Gin Leu Gly Glu Met Val Thr Val Leu Ser He Asp Gly Gly Gly He 
145 150 155 160 



Arg Gly He He Pro Ala Thr He Leu Glu Phe Leu Glu Gly Gin Leu 

165 170 175 



Gin Glu Met Asp Asn Asn Ala Asp Ala Arg Leu Ala Asp Tyr Phe Asp 
180 185 190 



Val He Gly Gly Thr Ser Thr Gly Gly Leu Leu Thr Ala Met He Se, 
195 200 205 



Thr Pro Asn Glu Asn Asn Arg Pro Phe Ala Ala Ala Lys Glu He Val 
210 215 220 



Pro Phe Tyr Phe Glu His Gly Pro Gin He Phe Asn Pro Ser Gly Gin 
225 230 235 240 



He Leu Gly Pro Lys Tyr Asp Gly Lys Tyr Leu Met Gin Val Leu Gin 
245 250 255 



Glu Lys Leu Gly Glu Thr Arg Val His Gin Ala Leu Thr Glu Val Val 
260 265 270 



He Ser Ser Phe Asp He Lys Thr Asn Lys Pro Val He Phe Thr Lys 
275 280 285 



Ser Asn Leu Ala Asn Ser Pro Glu Leu Asp Ala Lys Met Tyr Asp He 
290 295 300 



Ser Tyr Ser Thr Ala Ala Ala Pro Thr Tyr Phe Pro Pro His Tyr Phe 
305 310 315 320 



Val Thr Asn Thr Ser Asn Gly Asp Glu Tyr Glu Phe Asn Leu Val Asp 
325 330 335 



Gly Ala Val Ala Thr Val Ala Asp Pro Ala Leu Leu Ser He Ser Val 
340 345 350 



57 Ala Thr Arg Leu Ala Gin Lys Asp Pro Ala Phe Ala Ser He Arg Ser 
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1 355 360 365 

2 

3 Leu Asn Tyr Lys Lys Met Leu Leu Leu Ser Leu Gly Thr Gly Thr Thr 

4 370 375 380 
5 

6 Ser Glu Phe Asp Lys 

7 385 
8 

9 <210> 276 

10 <211> 7 

11 <212> PRT 

12 <213> Synthetic 
13 

14 <400> 276 
15 

16 Gly Gly Gly Ser Gly Gly Gly 

17 1 5 
18 

19 <210> 277 

20 <211> 3 

21 <212> PRT 

22 <213> Synthetic 
23 

24 <400> 277 
25 

26 Gly Pro Gly 

27 1 
28 

29 <210> 278 

30 <211> 386 

31 <212> PRT 

32 <213> Solanum tuberosum 
33 

34 <400> 278 
35 

36 Met Ala Thr Thr Lys 

37 1 5 
38 

39 Thr Thr Ser Ser Thr 

40 20 
41 

42 Ser He Asp Gly Gly 

43 35 

45 Glu Phe Leu Glu Gly 

46 50 
47 

48 Arg Leu Ala Asp Tyr 

49 65 
50 

51 Leu Leu Thr Ala Met 

52 85 
53 

54 Ala Ala Ala Lys Asp 

55 100 
56 

57 He Phe Asn Tyr Ser 



Ser Phe Leu He Leu Phe Phe Met He Leu Ala 
10 15 

Cys Ala Lys Leu Glu Glu Met Val Thr Val Leu 
25 30 

Gly He Lys Gly He He Pro Ala He He Leu 
40 45 

Gin Leu Gin Glu Val Asp Asn Asn Lys Asp Ala 
55 60 

Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly 
70 75 80 

He Thr Thr Pro Asn Glu Asn Asn Arg Pro Phe 
90 95 

He Val Pro Phe Tyr Phe Glu His Gly Pro His 
105 110 

Gly Ser He He Gly Pro Met Tyr Asp Gly Lys 
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3 Tyr Leu Leu Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His 

4 130 135 140 

6 Gin Ala Leu Thr Glu Val Ala lie Ser Ser Phe Asp He Lys Thr Asn 

7 145 150 155 160 

9 Lys Pro Val He Phe Thr Lys Ser Asn Leu Ala Lys Ser Pro Glu Leu 

165 170 175 



12 


Asp 


Ala 


Lys 


14 


Tyr 


Phe 


Pro 


16 






195 


17 
18 


Tyr 


Glu 


Phe 


19 




210 




20 








21 


Ala 


Leu 


Leu 


22 


225 






23 










Ala 


Phe 


Ser 


23 








26 








27 


Ser 


Leu 


Gly 


28 








29 








30 


Gin 


Glu 


Ala 


31 






275 


32 








33 


n 




Thr 






290 




35 








36 


Thr 


Val 


Phe 


37 


305 














39 


Glu 


Asn 


Ala 


40 








41 








42 


Ala 


Asn 


Met 


43 








44 








45 


Pro 


Val 


Ser 


46 






355 


47 








48 


Phe 


Ala 


Lys 


49 




370 




50 








51 


Ser 


Tyr 




52 


385 






53 








54 


<210> 279 


55 


<211> 


386 


56 


<212> 


=RT 


57 


<213> Solar 
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2 <400> 279 

4 Met Ala Thr Thr Lys Ser Val Leu Val Leu Phe Phe Met lie Leu Ala 

5 1 5 10 15 
6 

7 Thr Thr Ser Ser Thr Cys Ala Thr Leu Gly Glu Met Val Thr Val Leu 



10 Ser He Asp Gly Gly Gly He Lys Gly He He Pro Ala Thr He Leu 

11 35 40 45 
12 

13 Glu Phe Leu Glu Gly Gin Leu Gin Glu Val Asp Asn Asn Lys Asp Ala 

14 50 55 60 
15 

16 Arg Leu Ala Asp Tyr Phe Asp Val He Gly Gly Thr Ser Thr Gly Gly 

17 65 70 75 80 
18 

19 Leu Leu Thr Ala Met He Thr Thr Pro Asn Glu Asn Asn Arg Pro Phe 

20 8 5 9 0 9 5 
21 

22 Ala Ala Ala Lys Asp He Val Pro Phe Tyr Phe Glu His Gly Pro His 

23 100 105 110 
24 

25 He Phe Asn Ser Ser Gly Ser He Phe Gly Pro Met Tyr Asp Gly Lys 

26 115 1 20 1 25 
27 

28 Tyr Phe Leu Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His 

29 1 30 1 35 1 40 
30 

31 Gin Ala Leu Thr Glu Val Ala He Ser Ser Phe Asp He Lys Thr Asn 

32 145 150 155 160 
33 

34 Lys Pro Val He Phe Thr Lys Ser Asn Leu Ala Lys Ser Pro Glu Leu 

35 165 170 175 
36 

37 Asp Ala Lys Met Asn Asp He Cys Tyr Ser Thr Ala Ala Ala Pro Thr 

38 180 185 190 
39 

40 Tyr Phe Pro Pro His Tyr Phe Val Thr His Thr Ser Asn Gly Asp Lys 

41 195 ' 200 205 
42 

43 Tyr Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Gly Asp Pro 

44 210 215 220 
45 

46 Ala Leu Leu Ser Leu Ser Val Arg Thr Lys Leu Ala Gin Val Asp Pro 

47 2 2 5 2 3 0 2 3 5 2 4 0 
48 

49 Lys Phe Ala Ser He Lys Ser Leu Asn Tyr Asn Glu Met Leu Leu Leu 

50 245 250 255 
51 

52 Ser Leu Gly Thr Gly Thr Asn Ser Glu Phe Asp Lys Thr Tyr Thr Ala 

53 2 6 0 2 6 5 2 7 0 
54 

55 Glu Glu Ala Ala Lys Trp Gly Pro Leu Arg Trp He Leu Ala He Gin 

56 2 7 5 2 8 0 2 8 5 
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1 Gin Met Thr Asn Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser 

2 290 295 300 





Thr 


Val 


Phe 


Gin 


Ala 






305 










310 




Glu 


Asn 


Ala 


Leu 


Thr 


Gly 












325 


















10 


Ala 


Asn 


Met 


Glu 


Leu 




" 








340 






12 














13 


Pro 


Val 


Ser 


Lys 


Asp 


Ser 


14 






355 






















16 


Phe 


Ala 


Lys 


Leu 


Leu 


Ser . 






370 










18 














19 


Ser 


Tyr 










20 


385 












21 














22 


<210> ; 


280 








23 


<211> : 


365 








24 


<212> ] 


PRT 








25 


<213> : 


Solanum tuberosum 


26 














27 


<400> : 


280 








28 














29 


Met 


Ala 


Leu 


Glu 


Glu 


Met 


30 


1 








5 




31 














32 


He 


Lys 


Gly 


He 


He 


Pro 


33 








20 






34 














35 


Leu 


Gin 


Lys 


Met 


Asp 


Asn 


36 






35 








37 














38 


Asp 


Val 


He 


Gly 


Gly 


Thr 


39 




50 










40 














41 


Thr 


Thr 


Pro 


Asn 


Glu 


Asn 


42 


65 










70 


43 














44 


Val 


Pro 


Phe 


Tyr 


Phe 


Glu 


43 










85 




46 


Trp 


Pro 


He 


Phe 


Trp 


Pro 


48 








100 






49 














50 


Leu 


Gin 


Glu 


Lys 


Leu 


Gly 


5! 






115 








52 














53 


Val 


Ala 


He 


Ser 


Ser 


Phe 


54 




130 










55 














56 


Thr 


Lys 


Ser 


Asn 


Leu 


Ala 


57 


145 










150 
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2 


Asp lie Cys 


Tyr 


Ser 


3 






165 


4 
5 


Tyr Phe Ala 


Thr 


Asn 


6 




180 




7 








8 


Val Asp Gly 


Ala 


Val 


9 


195 






10 








11 


Ser Val Ala 


Thr 


Arg 


12 


210 






13 








14 


Arg Ser Leu 


Asn 


Tyr 


15 


225 






16 








17 


Thr Thr Ser 


Glu 


Phe 


18 






245 


19 








20 


Trp Gly Ala 


Leu 


Gin 


21 




260 




22 








23 


Ala Ser Ser 


Tyr 


Met 


24 


275 






25 








26 


Leu His Ser 


Gin 


Asn 


27 


290 






28 








29 


Gly Thr Thr 


Thr 


Lys 


30 


305 






31 








32 


Leu Ala Gin 


Val 


Gly 


33 






325 


34 








35 


Asn Pro Glu 


Thr 


Tyr 


36 




340 




37 








38 


Ser Asp Arg 


Lys 


Lys 


39 


355 






40 








41 


<210> 281 






42 


<211> 364 






43 


<212> PRT 






44 


<213> Solanum i 


tubei 


45 








46 


<400> 281 






47 








48 


Pro Trp Leu 


Glu 


Glu 


49 


1 




5 


50 








51 


He Lys Gly 


He 


He 


52 




20 




53 








54 


Leu Gin Glu 


Val 


Asp 


55 


35 






56 








57 


Asp Val He 


Gly 


Gly 
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Thr Thr Pro Asn Glu Asn Asn Arg Pro Phe Ala Ala Ala Lys Asp He 
65 70 75 80 



Val Pro Phe Tyr Phe Glu His Gly Pro His He Phe Asn Tyr Ser Gly 
85 90 95 



Ser He Leu Gly Pro Met Tyr Asp Gly Lys Tyr Leu Leu Gin Val Leu 
100 105 110 



Gin Glu Lys Leu Gly Glu Thr Arg Val His Gin Ala Leu Thr Glu Val 
115 120 125 



Ala He Ser Ser Phe Asp He Lys Thr Asn Lys Pro Val He Phe Thr 
130 135 140 



Lys Ser Asn Leu Ala Lys Ser Pro Glu Leu Asp Ala Lys Met Tyr Asp 
145 150 155 160 



He Cys Tyr Ser Thr Ala Ala Ala Pro He Tyr Phe Pro Pro His His 
165 170 175 



Phe Val Thr His Thr Ser Asn Gly Ala Arg Tyr Glu Phe Asn Leu Val 
180 185 190 



Asp Gly Ala Val Ala Thr Val Gly Asp Pro Ala Leu Leu Ser Leu Ser 
195 200 205 



Val Ala Thr Arg Leu Ala Gin Glu Asp Pro Ala Phe Ser Ser He Lys 
210 215 220 



Ser Leu Asp Tyr Lys Gin Met Leu Leu Leu Ser Leu Gly Thr Gly Thr 
225 230 235 240 



Asn Ser Glu Phe Asp Lys Thr Tyr Thr Ala Glu Glu Ala Ala Lys Trp 
245 250 255 



Gly Pro Leu Arg Trp Met Leu Ala He Gin Gin Met Thr Asn Ala Ala 
260 265 270 



Ser Phe Tyr Met Thr Asp Tyr Tyr He Ser Thr Val Phe Gin Ala Arg 
275 280 285 



His Ser Gin Asn Asn Tyr Leu Arg Val Gin Glu Asn Ala Leu Asn Gly 
290 295 300 



Thr Thr Thr Glu Met Asp Asp Ala Ser Glu Ala Asn Met Glu Leu Leu 
305 310 315 320 



Val Gin Val Gly Glu Thr Leu Leu Lys Lys Pro Val Ser Arg Asp Ser 
325 330 335 



Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg Phe Ala Lys Leu Leu Ser 
340 345 350 



57 Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala Ser Tyr 
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1 






355 


2 
3 


<210> 282 


4 


<211> 386 


5 


<212> PRT 


6 


<213> Solar 


7 
8 


<400> 282 


9 








10 


Met 


Ala 


Thr 


11 


1 






12 








13 


Thr 


Thr 


Ser 


14 








15 








16 


Ser 


lie 


Asp 


17 






35 


18 








19 


Glu 


Phe 


Leu 


20 




50 




21 








22 




Leu 


Ala 


23 








24 








25 


Leu 


Leu 


Thr 


26 








27 








28 


Ala 


Ala 


Ala 


29 








30 








31 


lie 


Phe 


Asn 


32 






115 


33 








34 


T 


Leu 


Leu 


35 




130 




36 








37 


Gin 


Ala 


Leu 


38 


145 






39 








40 


Lvs 


Pro 


Val 


41 








42 








43 


Asp 


Ala 


Lys 


44 








45 








46 


Tyr 


Phe 


Pro 


47 






195 


48 








49 


Tyr 


Glu 


Phe 


50 




210 




51 








52 


Ala 


Leu 


Leu 


53 


225 






54 








55 


Ala 


Phe 


Ser 


56 








57 
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Ser Leu Gly Thr Gly Thr Asn Ser Glu Phe Asp Lys Thr Tyr Thr Ala 
260 265 270 

Glu Glu Ala Ala Lys Trp Gly Pro Leu Arg Trp Met Leu Ala He Gin 
275 280 285 

Gin Met Thr Asn Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr He Ser 
290 295 300 

Thr Val Phe Gin Ala Arg His Ser Gin Asn Asn Tyr Leu Arg Val Gin 
305 310 315 320 

Glu Asn Ala Leu Asn Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu 
325 330 335 

Ala Asn Met Glu Leu Leu Val Gin Val Gly Ala Thr Leu Leu Lys Lys 
340 345 350 

Pro Val Ser Lys Asp Ser Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg 
355 360 365 

Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala 
370 375 380 

Ser Tyr 
385 

<210> 283 
<211> 10 
<212> PRT 
<213> Synthetic 

<400> 283 

Ala Phe Phe Asp Lys Thr Tyr Thr Ala Lys 
15 10 

<210> 284 

<211> 10 

<212> PRT 

<213> Synthetic 

<400> 284 

Cys He Phe Asp Ser Thr Tyr Thr Ala Lys 
15 10 

<210> 285 
<211> 1161 
<212> DNA 

<213> Solanum tuberosum 
<400> 285 

atggcaacta ctaaatcttt tttaatttta atatttatga tattagcaac tactagttca 
acatttgctc agttgggaga aatggtgact gttcttagta ttgatggagg tggaattaga 
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gggatcattc 


cggctaccat 


tctcgaattt 


cttgaaggac 


aacttcagga 


aatggacaat 


180 


aatgcagatg 


caagacttgc 


agattacttt 


gatgtaattg 


gaggaacaag 


tacaggaggt 


240 


ttattgactg 


ctatgataag 


tactccaaat 


gaaaacaatc 


gaccctttgc 


tgctgccaaa 


300 


gaaattgtac 


ctttttactt 


cgaacatggc 


cctcagattt 


ttaatcctag 


tggtcaaatt 


360 


ttaggcccaa 


aatatgatgg 


aaaatatctt 


atgcaagttc 


ttcaagaaaa 


acttggagaa 


420 


actcgtgtgc 


atcaagcttt 


gacagaagtt 


gtcatctcaa 


gctttgacat 


caaaacaaat 


480 


aagccagtaa 


tattcactaa 


gtcaaattta 


gcaaactctc 


cagaattgga 


tgctaagatg 


540 


tatgacataa 


gttattccac 


agcagcagct 


ccaacatatt 


ttcctccgca 


ttactttgtt 


600 


actaatacta 


gtaatggaga 


tgaatatgag 


ttcaatcttg 


ttgatggtgc 


tgttgctact 


660 


gttgctgatc 


cggcgttatt 


atccattagc 


gttgcaacga 


gacttgcaca 


aaaggatcca 


720 


gcatttgctt 


caattaggtc 


attgaattac 


aaaaaaatgc 


tgttgctctc 


attaggcact 


780 


ggcactactt 


cagagtttga 


taaaacatat 


acagcaaaag 


aggcagctac 


ctggactgct 


840 


gtacattgga 


tgttagttat 


acagaaaatg 


actgatgcag 


caagttctta 


catgactgat 


900 


tattaccttt 


ctactgcttt 


tcaagctctt 


gattcaaaaa 


acaattacct 


cagggttcaa 


960 


gaaaatgcat 


taacaggcac 


aactactgaa 


atggatgatg 


cttctgaggc 


taatatggaa 


1020 


ttattagtac 


aagttggtga 


aaacttattg 


aagaaaccag 


tttccgaaga 


caatcctgaa 


1080 


acctatgagg 


aagctctaaa 


gaggtttgca 


aaattgctct 


ctgataggaa 


gaaactccga 


1140 


gcaaacaaag 


cttcttatta 


a 








1161 



<210> 286 
<211> 386 
<212> PRT 

<213> Solanum tuberosum 
<400> 286 

Met Ala Thr Thr Lys Ser Phe Leu lie Leu lie Phe Met lie Leu Ala 



Thr Thr Ser Ser Thr Phe Ala Gin Leu Gly Glu Met Val Thr Val Leu 
20 25 30 

Ser lie Asp Gly Gly Gly lie Arg Gly lie lie Pro Ala Thr lie Leu 
35 40 45 

Glu Phe Leu Glu Gly Gin Leu Gin Glu Met Asp Asn Asn Ala Asp Ala 



Arg Leu Ala Asp Tyr Phe Asp Val lie Gly Gly Thr Ser Thr Gly Gly 
65 70 75 80 
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Leu Leu Thr Ala Met lie Ser Thr Pro Asn Glu Asn Asn Arg Pro Phe 
85 90 95 



Ala Ala Ala Lys Glu He Val Pro Phe Tyr Phe Glu His Gly Pro Gin 
100 105 110 



He Phe Asn Pro Ser Gly Gin He Leu Gly Pro Lys Tyr Asp Gly Lys 
115 120 125 



Tyr Leu Met Gin Val Leu Gin Glu Lys Leu Gly Glu Thr Arg Val His 
130 135 140 



Gin Ala Leu Thr Glu Val Val He Ser Ser Phe Asp He Lys Thr Asn 
145 150 155 160 



Lys Pro Val He Phe Thr Lys Ser Asn Leu Ala Asn Ser Pro Glu Leu 
165 170 175 



Asp Ala Lys Met Tyr Asp He Ser Tyr Ser Thr Ala Ala Ala Pro Thr 
180 185 190 



Tyr Phe Pro Pro His Tyr Phe Val Thr Asn Thr Ser Asn Gly Asp Glu 
195 200 205 



Tyr Glu Phe Asn Leu Val Asp Gly Ala Val Ala Thr Val Ala Asp Pro 
210 215 220 



Ala Leu Leu Ser He Ser Val Ala Thr Arg Leu Ala Gin Lys Asp Pro 
225 230 235 240 



Ala Phe Ala Ser He Arg Ser Leu Asn Tyr Lys Lys Met Leu Leu Leu 
245 250 255 



Ser Leu Gly Thr Gly Thr Thr Ser Glu Phe Asp Lys Thr Tyr Thr Ala 
260 265 270 



Lys Glu Ala Ala Thr Trp Thr Ala Val His Trp Met Leu Val He Gin 
275 280 285 



Lys Met Thr Asp Ala Ala Ser Ser Tyr Met Thr Asp Tyr Tyr Leu Ser 
290 295 300 



Thr Ala Phe Gin Ala Leu Asp Ser Lys Asn Asn Tyr Leu Arg Val Gin 
305 310 315 320 



Glu Asn Ala Leu Thr Gly Thr Thr Thr Glu Met Asp Asp Ala Ser Glu 
325 330 335 



Ala Asn Met Glu Leu Leu Val Gin Val Gly Glu Asn Leu Leu Lys Lys 
340 345 350 



Pro Val Ser Glu Asp Asn Pro Glu Thr Tyr Glu Glu Ala Leu Lys Arg 
355 360 365 



Phe Ala Lys Leu Leu Ser Asp Arg Lys Lys Leu Arg Ala Asn Lys Ala 
370 375 380 
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2 


Ser 


Tyr 




















3 
4 


385 






















5 


<210> 2 


87 


















6 
7 


<211> 408 
<212> PRT 


















S 


<213> W 


[098154327 12/3/S 


»8 {Equinous 


(Genus) 


9 


macrolota) 


















10 
























11 


<400> 2 


:87 


















12 
























13 


Met 


Lys 


Ser 


Lys 


Met 


Ala 


Met 


Leu 


Leu 


Leu 


Leu 


14 


1 








5 










10 




15 
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2 Leu Thr Thr Phe Tyr Leu Glu Asn Gly Pro Lys He Phe Pro Gin Lys 

3 100 105 110 
4 

5 Lys Ala Gly Leu Leu Thr Pro Leu Arg Asn Leu Leu Gly Leu Val Arg 

6 115 120 125 
7 

■8 Gly Pro Lys Tyr Asp Gly Val Phe Leu His Asp Lys He Lys Ser Leu 
9 130 135 140 

10 

11 Thr His Asp Val Arg Val Ala Asp Thr Val Thr Asn Val He Val Pro 

12 145 150 155 160 
13 

14 Ala Phe Asp Val Lys Tyr Leu Gin Pro He He Phe Ser Thr Tyr Glu 

15 165 170 175 
16 

17 Ala Lys Thr Asp Thr Leu Lys Asn Ala His Leu Ser Asp He Cys He 

18 180 185 190 
19 

20 Ser Thr Ser Ala Ala Pro Thr Tyr Phe Pro Ala His Phe Phe Lys Thr 

21 195 200 205 
22 

23 Glu Ala Thr Asp Gly Arg Pro Pro Arg Glu Tyr His Leu Val Asp Gly 

24 210 215 220 
25 

26 Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Ser Met Leu Thr 

27 2 2 5 2 3 0 2 3 5 2 4 0 
28 

29 Lys Glu Val His Arg Arg Asn Pro Asn Phe Asn Ala Gly Ser Pro Thr 

30 245 250 255 
31 

32 Glu Tyr Thr Asn Tyr Leu He He Ser Val Gly Thr Gly Ser Ala Lys 

33 2 6 0 2 6 5 2 7 0 
34 

35 Gin Ala Glu Lys Tyr Thr Ala Glu Gin Cys Ala Lys Trp Gly Leu He 

36 2 7 5 2 8 0 2 8 5 
37 

38 Gin Trp Leu Tyr Asn Gly Gly Phe Thr Pro He He Asp He Phe Ser 

39 290 295 300 
40 

41 His Ala Ser Ser Asp Met Val Asp He His Ala Ser He Leu Phe Gin 

42 305 310 315 320 
43 

44 Ala Leu His Cys Glu Lys Lys Tyr Leu Arg He Gin Asp Asp Thr Leu 

45 325 330 335 
46 

47 Thr Gly Asn Ala Ser Ser Val Asp He Ala Thr Lys Glu Asn Met Glu 

48 340 345 350 
49 

50 Ser Leu He Ser He Gly Gin Glu Leu Leu Lys Lys Pro Val Ala Arg 

51 355 360 365 
52 

53 Val Asn He Asp Thr Gly Val Tyr Glu Ser Cys Asp Gly Glu Gly Thr 

54 370 375 380 

55 

56 Asn Ala Gin Ser Leu Ala Asp Phe Ala Lys Gin Leu Ser Asp Glu Arg 

57 3 8 5 3 9 0 3 9 5 4 0 0 
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Leu Val Asp Gly Gly Val Ala Ala Asn Asn Pro Thr Met 
260 265 



Val Ala Met 
270 



Ser Met Leu Thr Lys Glu Val His Arg Arg Asn Pro Asn Phe Asn Ala 
275 280 285 



Gly Ser Pro Thr Glu Tyr Thr Asn Tyr Leu lie lie Ser Val Gly Thr 
290 295 300 



Gly Ser Ala Lys Gin Ala Glu Lys Tyr Thr Ala Glu Gin 
305 310 315 



Cys Ala Lys 
320 



Trp Gly Leu lie Gin Trp Leu Tyr Asn Gly Gly Phe Thr 
325 330 



Pro He He 
335 



Asp He Phe Ser His Ala Ser Ser Asp Met Val Asp He 
340 345 



His Ala Ser 
350 



He Leu Phe Gin Ala Leu His Cys Glu Lys Lys Tyr Leu 
355 360 365 



Leu Tyr Tyr Ala Gly Tyr Phe Asp Trp Glu Arg He Val 
370 375 380 



Arg He Gin 
Arg Gly His 



Arg His Gin Gly Glu His Gly Val Ser Asp He Asp Arg 
385 390 395 



Pro Gly Ala 
400 



Ala Gin Glu Ala Ser Gly Glu Ser Glu His Arg His Arg 
405 410 



Ala Val Arg 
415 



Val Leu Arg Arg Gly His Lys Cys Thr Val Ala Ser Leu Arg Gin Ala 
420 425 430 



Thr Leu Arg Ala Gin Ala Thr Gin Glu Gin Ser Gin Leu Gin Leu He 
435 440 445 



Asn Thr Ser Leu Ser His Ser Met Cys Ser Phe Arg Arg Phe Thr Val 
450 455 460 



Ser Tyr Phe Phe Asn Phe Asn Ser Val Cys Val Leu Cys Val Leu Cys 
465 470 475 480 



Val Tyr Gin Thr Phe Lys Phe Asn Gin Lys Lys Lys Lys 
485 490 



Lys Lys Lys 
495 



Lys Lys Lys Lys Lys Lys Lys Lys Lys Arg Ala Ala 
500 505 



51 <210> 290 

52 <211> 410 

53 <212> PRT 

54 <213> Zea mays 
55 

56 <400> 290 
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Met Gly Ser He Gly Arg Gly Thr Ala Asn Cys Ala Thr Val Pro Gin 
15 10 15 



Pro Pro Pro Ser Thr Gly Lys Leu He Thr He Leu Ser He Asp Gly 
20 25 30 



Gly Gly He Arg Gly Leu He Pro Ala Thr He He Ala Tyr Leu Glu 
35 40 45 



Ala Lys Leu Gin Glu Leu Asp Gly Pro Asp Ala Arg He Ala Asp Tyr 
50 55 60 



Phe Asp Val He Ala Gly Thr Ser Thr Gly Ala Leu Leu Ala Ser Met 
65 70 75 80 



16 Leu Ala Ala Pro Asp Glu Asn Asn Arg Pro Leu Phe Ala Ala Lys Asp 



Leu Thr Thr Phe Tyr Leu Glu Asn Gly Pro Lys He Phe Pro Gin Lys 
100 105 110 



Lys Ala Gly Leu Leu Thr Pro Leu Arg Asn Leu Leu Gly Leu Val Arg 
115 120 125 



Gly Pro Lys Tyr Asp Gly Val Phe Leu His Asp Lys He Lys Ser Leu 
130 135 140 



Thr His Asp Val Arg Val Ala Asp Thr Val Thr Asn Val He Val Pro 
"145 150 155 160 



Ala Phe Asp Val Lys Ser Leu Gin Pro He He Phe Ser Thr Tyr Glu 
165 170 175 



Ala Lys Thr Asp Thr Leu Lys Asn Ala His Leu Ser Asp He Cys He 
180 185 190 



Ser Thr Ser Ala Ala Pro Thr Tyr Phe Pro Ala His Phe Phe Lys Thr 
195 200 205 



Glu Ala Thr Asp Gly Arg Pro Pro Arg Glu Tyr His Leu Val Asp Gly 
210 215 220 



Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Ser Met Leu Thr 
225 230 235 240 



Lys Glu Val His Arg Arg Asn Pro Asn Phe Asn Ala Gly Ser Pro Thr 
245 250 255 



Glu Tyr Thr Asn Tyr Leu He He Ser Val Gly Thr Gly Ser Ala Lys 
260 265 270 



Gin Ala Glu Lys Tyr Thr Ala Glu Gin Cys Ala Lys Trp Gly Leu He 
275 280 285 



Gin Trp Leu Tyr Asn Gly Gly Phe Thr Pro He He Asp He Phe Ser 
290 295 300 
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15 10 15 

Pro Pro Pro Ser Thr Gly Lys Leu lie Thr lie Leu Ser lie Asp Gly 



Gly Gly He Arg Gly Leu He Pro Ala Thr He He Ala Tyr Leu Glu 
35 40 45 



Ala Lys Leu Gin Glu Leu Asp Gly Pro Asp Ala Arg He Ala Asp Tyr 
50 55 60 

Phe Asp Val He Ala Gly Thr Ser Thr Gly Ala Leu Leu Ala Ser Met 



70 



75 



Leu Ala Ala Pro Asp Glu Asn Asn Arg Pro Leu Phe Ala Ala Lys Asp 
85 90 95 



Leu Thr Thr Phe Tyr Leu Glu Asn Gly Pro Lys He Phe Pro Gin Lys 
100 105 110 



Lys Ala Gly Leu Leu Thr Pro Leu Arg Asn Leu Leu Gly Leu Val Arg 
115 120 125 



Gly Pro Lys Tyr Asp Gly Val Phe Leu His Asp Lys He Lys Ser Leu 
130 135 140 



Thr His Asp Val Arg Val Ala Asp Thr Val Thr Asn Val He Val Pro 
145 150 155 160 



Ala Phe Asp Val Lys Ser Leu Gin Pro He He Phe Ser Thr Tyr Glu 
165 170 175 



Ala Lys Thr Asp Thr Leu Lys Asn Ala His Leu Ser Asp He Cys He 
180 185 190 



Ser Thr Ser Ala Ala Pro Thr Tyr Phe Pro Ala His Phe Phe Lys He 
195 200 205 



Glu Ala Thr Asp Gly Arg Pro Pro Arg Glu Tyr His Leu Val Asp Gly 
210 215 220 



Gly Val Ala Ala Asn Asn Pro Thr Met Val Ala Met Ser Met Leu Thr 
225 230 235 240 



Lys Glu Val His Arg Arg Asn Pro Asn Phe Asn Ala Gly Ser Pro Thr 
245 250 255 



Glu Tyr Thr Asn Tyr Leu He He Ser Val Gly Thr Gly Ser Ala Lys 
260 265 270 



Gin Ala Glu Lys Tyr Thr Ala Glu Gin Cys Ala Lys Trp Gly Leu He 
275 280 285 



Gin Trp Leu Tyr Asn Gly Gly Phe Thr Pro He He Asp He Phe Ser 
290 295 300 



57 His Ala Ser Ser Asp Met Val Asp He His Ala Ser He Leu Phe Gin 
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